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THE  COUPLED  HOLE  PHONON  SYSTEM  AND 
MINORITY-ELECTRON  TRANSPORT  IN  p-GaAs 
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(Rcceiifd  16  July  1991;  m  rnurd  form  U  January  1992/ 

Abstract— We  present  a  Monte-Carlo  studs  of  minonty-eleciron  transport  in  p-Oa.As  including  dynamic 
screening  and  plasmon -phonon  coupmg  using  a  phenomenologically  damped  quasi-nonequilihnum 
two-band  dielecinc  function.  Calculated  mode  spectra  contain  the  acoustic  and  optical  plasmons  of  the 
tvso-component  hole  plasma,  and  suggest  that  despite  the  role  of  the  acoustic  plasmon,  plasma  oscillations 
do  not  extend  to  wave  vectors  as  large  as  the  inverse  screening  length  Furthermore,  we  find  no  evidence 
of  the  drag  effect  For  heavy  dopings,  calculated  high-field  minority-electron  drift  velocities  are 
significantly  larger  than  experimental  val’ies.  indicating  that  further  work  is  required  to  achieve  a  true 
understanding  of  minority-electron  transport 


t.NTRODLCTlON  plasmon  cut-off  wave  vector  9c  =  ‘  was  required  to 

obtain  agreement  with  experimental  values  Interest- 
During  the  last  few  years,  minority  electron  transport  ingly,  this  relation  differs  from  that  used  by 
has  been  the  subject  of  a  number  of  experimental  and  Taniyama  et  at..  9c  =  2rtr/',  It  appears  then  that 
theoretical  investigations,  not  only  because  of  its  there  is  no  consensus  on  what  the  proper  theory  for 
relevance  to  bipolar  devices,  but  also  because  of  its  minority-electron  transport  is.  In  fact,  there  may 
importance  as  an  interesting  problem  in  semiclassical  be  disagreement  over  what  constitutes  a  theoretical 
transport  theory.  The  first  measurements  of  minority-  explanation  of  experimental  data, 
electron  velocity-field  characteristics!  1].  performed  As  noted  in  our  earlier  work[3).  a  number  of 
on  lightly-doped  Ino.jGancjAs,  suggested  that  the  theoretical  issues  deserve  further  investigation, 
high-field  drift  velocity  of  minority  electrons  is  among  them  the  effect  of  dynamic  screening  and 
significantly  larger  than  that  of  majority  electrons,  plasmon-phonon  coupling  in  the  two-band  hole  sys- 
Later.  Monte-Carlo  simulations  of  Osman  and  tern.  Despite  the  contributions  of  a  number  of  inves- 
Grubin[2]  indicated  that  statically  screened  electron-  tigators(5-IO],  a  comprehensive  treatment  of  dynamic 
heavy-hole  interactions  do  not  account  for  sufficient  screening  and  plasmon-phonon  coupling  is  still  tack- 
minoriiy-electron  energy  loss  to  explain  large  high-  ing.  To  us.  the  use  of  a  cut-off  wave  vector,  an 
field  velocities  at  low  hole  concentrations.  They  did,  artificial,  vaguely  defined,  yet  sharp  dividing  line 
however,  show  larger  high-field  velocities  for  min-  between  excitations  of  collective  and  single-particle 
orily  electrons  than  majority  electrons,  Sadra  et  character,  is  fundamentally  dissatisfying.  Given  the 
a/.{3, 4|  took  statically-screened  multiband  processes  inherent  vagueness  in  the  position  of  the  cut-off.  its 
into  account  and  found  that  although  minority-elec-  value  can  be  adjusted  to  fit  any  given  measurement, 
iron  energy  loss  was  enhanced,  it  was  still  too  small  The  treatment  of  these  excitations  is  even  less  appeal- 
to  explain  the  measurements.  Subsequent  calcu-  ing  when,  as  is  usually  the  case,  the  plasmon  is  taken 
lations  by  Saito  et  £i/.[5]  also  showed  larger  minority-  to  extend  to  the  cul-off  wave  vector  undamped  and 
electron  drift  velocities  than  majority  electron  values,  undispersed.  We  believe  that  a  theory  of  minority 
In  1990,  new  experimental  results(6]  showed  that,  at  electron  transport  should  not  rely  on  a  cut-off  wave 
least  in  GaAs.  the  high-field  minority-electron  drift  vector,  particularly  if  calculated  results  are  sensitive 
velocity  is  in  fact  lower  than  the  majority-electron  to  its  value.  Fortunately,  the  frequency-  and  wave- 
value,  Taniyama  et  al.[l\  performed  Monte-Carlo  vector-dependent  dielectric  function  provides  a  natu- 
calculations  of  minority-electron  velocity-field  ral  framework  for  removing  the  cut-off  wave  vector, 
characteristics  and  found  reasonable  agreement  with  The  dielectric  function  gives  the  complete  spectrum  of 
measurements.  However,  the  agreement  was  obtained  plasma  excitations,  from  collective  to  single-particle, 
by  replacing  the  heavy-hole  effective  mass  with  the  with  dispersion  and  damping,  for  all  wave  vectors.  As 
light-hole  value  in  the  calculation  of  the  screening  the  wave  vector  increases  and  the  plasma  can  no 
length.  More  recently,  Lowney  and  Bcnnett[8]  calcu-  longer  sustain  collective  oscillations,  modes  gradually 
lated  the  minority-electron  mobility  and  found  that  a  disappear  in  the  calculated  spectra.  The  dielectric 
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function  is  also  a  natural  vehicle  for  including  the 
often-neglected  effects  of  light  holes,  among  them  the 
rise  in  the  plasma  frequencs  and  the  appearance  of 
the  acoustic  plasmon. 

In  this  paper,  vve  report  bipolar  Monte-Carlo 
simulations  of  steady-state  minority-elcc'ron  trans¬ 
port.  including  dynamic  screening  and  plasmon- 
phonon  coupling  given  by  a  phenomenologically 
damped  quasi-nonequiiibnum  two-band  dielectric 
function.  A  similar  treatment  has  been  recently  used 
in  connection  with  electron  transport  through  an 
equilibrium  hole  plasma  in  heterojunctton  bipolar 
transistors!  1 1).  However,  it  has  not  been  extended  to 
the  problem  of  interest  here,  that  of  steady-state 
minority-electron  velocity-field  characteristics.  We 
begin  with  a  presentation  of  the  theoretical  formu¬ 
lation  and  the  relevant  computational  issues,  and 
then  proceed  to  describe  the  results  and  discuss  their 
implications. 


where  the  maximum  is  with  respect  to  q,  and  then 
transforming  to  k'  k  i-  q,  we  find  an  upper  bound 
on  the  scattering  rate  given  by. 
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Following  the  standard  technique! 1 2, 13)  we  omit 
the  degeneracy  term  1  -  /fk  )  from  the  integrand 
and  include  it  directly  in  the  Monte-Carlo  simulation. 
Integrating  over  Aoi,  transforming  the  integral  over 
k'  to  spherical  coordinates  with  the  polar  axis  along 
k,  and  integrating  over  the  aximulhal  angle  we 
have: 


2.  theoretical  formllation 


The  Monte-Carlo  model  employed  in  this  study  is 
identical  to  that  described  in  our  earlier  work[41 
except  for  the  addition  ol  degeneracy,  and  the  in¬ 
clusion  of  dynamic  screening  and  plasmon-phonon 
coupling  in  lieu  of  statically  screened  multiband 
carrier-carrier  and  polar  carrier-optical-phonon 
interactions.f  Degeneracy  is  included  in  the  simu¬ 
lation  following  the  standard  technique!12, 13).  As 
for  dynamic  screening  and  plasmon-phonon  coup¬ 
ling.  our  approach  is  to  use  the  simplest  possible 
formalism,  while  maintaining  frequency  and  wave- 
vector  dependence,  as  well  as  some  measure  of  exter¬ 
nal  damping.  The  treatment  is  described  below. 

For  a  carrier  with  wave  vector  k  and  energy  £,  the 
rate  of  scattering  by  the  coupled  phonon-carrier 
system  is  given  by  114): 


X  ln{ai)^  IJ-rl'n  —7 - : 

r  L<t(<|.  w)J 

x5F(k,k-t-q)^!£(k-t-q)-£)k)-+-ft<ul.  (1) 


where  w)  is  the  total  dielectric  function  of  the 
crystal  and  in  the  standard  approximation,  the  over¬ 
lap  factor  has  the  form: 


=  X(k,,k')  -f-  Y(k,k‘)cosi 

-f  Z(k,/t')cos'^,  (2) 

and  i  is  the  angle  between  k  and  k'.  We  use  the  usual 
values  for  X,  Y  and  Z  for  electrons  and  holes!3,4]. 
Defining:  ' 


'tThe  material  parameters  used  in  this  study  are  identical  to 
those  used  in  our  earlier  work,  except  for  the  new 
parameters  reported  in  this  text. 


X(k,k')+  Y(k.k')coi  i  Zik.k  )cos 


1_  k' +  k  ' -  2kk' coi  i 

x!nl(£*-  £.  )  h]  +  \]-\{E,  -  E,  )  h).  (5) 

Integrating  over  cos  4  and  transforming  back  to 
w  =  Eik')  -  E(k).  we  find: 
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(6) 


where 

k'(w)  =  v'2m*'(£-ka>)Il  +%'{£- hu))]lh.  (7) 

Pre-  and  post-scattering  effective  masses  and  non- 
parabolicities  m*.  m  a  and  *'  allow  the  application 
of  eqn  (6)  to  electrons  in  nonparabolic  bands  and  to 
inter-  and  intraband  scattering  of  holes. 

In  the  random  phase  approximation,  the  total 
dielectric  function  of  a  semiconductor  can  be  written 
as  the  sum[15}; 


ttfq,  w)  =  £i(q.  to)  -+-  f,(q.  w)  +  fh(q,  w).  (8) 


where  t,  and  £^  are  the  contributions  of  free  electrons 
and  holes,  respectively.  The  contribution  of  the 
crystal  in  the  absence  of  free  carriers  isflfi): 


w[o  —  ~  '•Wf 

~  iojy  ' 


(9) 


where  y  is  the  inverse  of  the  nonelectronic  LO- 
phonon  lifetime,  which  we  take  to  be  4.0  ps  at  room 
temperature!  1 7).  Treating  the  limit  of  low  minority 


Theory  of  iranipori  in  p  i 
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U! 


electron  concentratior-,  -ve  ignore  the  free  electron 
contribution.  For  the  hole  contribution,  we  use(18i: 

rJq.tu)  =  ^lVG- 
~ 

•ik)-,/:(k+q) 

X  - - ^  — -  ,  ,10) 

f  1  k  —  q  i  —  £,  ( k )  —  h(j  —  ihx 

where  indices  i  and  span  the  heavy-  and  iight-hole 
bands.  Note  that  the  o  •  erlap  factor  <7  is  twice  as  large 
as  the  standard  Monte-Carlo  overlap  factor  ’’4  since 
G  and  'S  are  summed  ^nd  averaged  over  initial  spin 
states,  respectively  Fo.-  a  collisionless  plasma,  a-»0 
and  eqn  (10)  already  includes  intrinsic  or  Landau 
damping.  For  a  plasma  of  particles  undergoing  scat¬ 
tering,  extrinsic  or  coliisional  damping  should  be 
included  as  well.  A  simple  and  widely-used  approach 
is  to  include  collisional  damping  by  interpreting  x  as 
a  phenomenological  damping  term  of  the  order  of  the 
inverse  relaxation  time  In  our  calculations,  we  take 
X  to  be  5  X  10'' s  S:2nificantly  larger  values  are 
unphysical;  values  up  to  an  order  of  magnitude 
smaller  do  not  produce  appreciably  diflferent  min¬ 
ority-electron  results;  and  much  smaller  values  are 
unphysically  small  and  cause  severe  difficulties  in 
numerical  integration  of  eqn  (10). 

The  theory  outlined  above  is  applicable  to  equi¬ 
librium  hole  plasmas.  Here,  to  obtain  some  measure 
of  the  effect  of  the  nonequilibrium  nature  of  the  hole 
plasma,  we  take  a  quasi-nonequilibrium  approach: 
we  use  the  simulated  nonequilibrium  hole  distri¬ 
bution  functions  in  eqn  ( 10).  This  seemingly  benign 
complication  creates  inordinate  difficulties.  Evalu¬ 
ation  of  eqn  (10)  for  a  numerically-known  simulated 
distribution  function  is  extremely  difficult,  particu¬ 
larly  for  small  i.  Furthermore,  in  spherical  bands,  the 
spherical  symmetry  of  equilibrium  distribution  func¬ 
tions  allows  us  to  calculate  only  as  a  function  of  (» 
and  |q|.  Under  an  applied  field,  however,  only  cylin¬ 
drical  symmetry  remains,  necessitating  evaluation  of 
as  a  function  of  c  j  and  two  components  of  q,  say 
Iql  and  T,  where  Tis  the  polar  angle  of  q  around  the 
direction  of  the  applied  field. +  The  problem  is  com¬ 
pounded  by  the  fact  that  numerical  integration  is  not 
entirely  suited  to  efficient  vector  processing.  Fortu¬ 
nately,  if  all  additive  terms  in  eqn  (10)  are  separated 
and  regrouped  according  to  the  band  i  of /(k)  and  the 
integral  over  k  is  transformed  to  spherical  coordi¬ 
nates  with  q  as  the  polar  axis,  the  azimuthal  integral 
becomes  independent  of  ;ql  and  ai  and  the  remaining 
multiplicative  portions  of  the  integrand  become  inde¬ 
pendent  of  the  distribution  function.  A  vector 
double-integration  algonthm  can  then  be  used  to 
evaluate  Ch  3t  a  few  thousand  points  (jq|, ai)  at  the 
same  time.  Each  time  the  integrands  are  required  at 
a  given  point,  the  azimuthal  integral  is  evaluated  only 

tNote  also  that  the  property  fh(— q,  -o>)  -  pro¬ 

vides  a  factor  of  two  reduction  in  the  required  compu¬ 
tational  effort. 


once  and  the  remaining  portions  of  all  the  integrands 
arc  cakiil.ncd  in  a  vectorized  lashion  !i  is  ihen 
possible  i‘>  perform  the  complete  calculation  of  ihc 
dielccliK  lunction  at  an  average  rate  of  i  50  M FLOPS 
on  a  smi-le  CRAY  YMP  processor  Higher  FLOP 
rates  ai«-  attainable  for  very  smooth  distribution 
function'  or  at  the  cost  of  larger  memory  We 
calculate  f®r  q  3  *  10  cm  /'=.■?•  10 

with  1=^1,  .  ,10  and  where 

=  6.0(i4o  ^  single-hole  simulaiion  i'  first  per¬ 
formed  to  obtain  the  steady -state  light-  and  heavy- 
hole  distributions.  Periodically,  the  simulation  is 
interrupted  and  and  the  associated  upper  bound  on 
the  hole  scattering  rate  are  reevaluated  using  the 
converging  distribution  functions  Only  intraband 
terms  are  included  in  the  calculation  of  in  the  hole 
simulations  to  avoid  problems  associated  with, 
crudely  speaking,  identifying  the  scattering  holes 
interchanging  bands  with  the  scattered  holc()9)  In 
the  siaiically-screened  pair-scaticnng  piciure.  it  is 
easy  to  find  the  single  scattering  hole  and  change  ns 
state.  In  the  present  dynamic  ireaimcni.  however,  the 
identities  of  the  holes  involved  in  the  scattering  of  the 
scattered  hole  arc  somewhat  obscured  and  further 
mixed  by  the  damping  term.  If.  in  the  single-pair 
picture,  most  interband  processes  are  viewed  as  essen¬ 
tially  a  light  hole  and  a  heavy  hole  exchanging  states, 
the  net  effect  can  be  regarded  as  small  It  should  be 
clear  that  ignoring  the  interband  contribution  to  the 
dielectric  function  is  not  equivalent  to  ignoring  inter- 
band  hole  scattering  events.  Interband  hole  scattering 
is  included,  as  in  our  previous  papers,  for  lonized- 
impurity  scattering,  acoustic  deformation-potential 
scattering,  and  nonpolar  optical-phonon  scattering. 
Also  included  are  interband  holc-coupled-modc  scat¬ 
tering  events  in  which  the  scattering  of  the  Monte- 
Carlo  hole  does  not  involve  inlerband  transitions  of 
other  holes,  but  involves  phonons  and  or  iniraband 
transitions  of  other  holes.  It  will  be  shown  below  that 
wnthin  our  treatment,  the  details  of  the  hole  distri¬ 
bution  do  not  appear  to  affect  minority-electron 
results  appreciably.  As  such,  ignoring  the  interband 
contribution  to  the  hole  dielectric  function  in  hole 
simulations  is  essentially  inconsequential. 

The  fintil  hole  distributions  obtained  from  the  hole 
simulations  are  used  to  calculate  including  both 
intra-  and  interband  terms,  and  the  upper  bound  on 
the  minoiity-clectron  scattering  rate  given  by  eqn  (6). 
for  subsequent  minority-electron  simulations.  Inter- 
band  terms  are  no  longer  problematic  since  a  low 
concentration  of  minority  electrons  will  not  affect  the 
majority  hole  distribution  significantly,  making  the 
identification  of  scattering  holes  unnecessary.  During 
both  electron  and  hole  simulations,  the  coupled-mode 
energy  is  first  chosen  according  to  the  distribution 
defined  l>v  the  integrand  of  eqn  (6).  This  is  done  using 
a  search  inble  containing  values  of 

/(n)  =  j*  do)[n(oj)  +  l]E(oi)  (I!) 
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and  the  rejection  technique  for  the  rest  ol  ilie  inte¬ 
grand  for  cchich  an  upper  bound  is  tabulated  lor  all 
£.  The  mode  energc  uniquely  deicrmincs  the  salnc  v'f 
k  .  Nest,  the  polar  angle  4  is  chosen  according  u<  lUe 
distribution  given  In  the  4 “tiependent  part  or  ihc 
integrand  in  eqn  tN  The  azimuthal  angle  iv  uni¬ 
formly  distributed  between  0  and  2n.  Tinally.  the 
exact  contribution  ot  ihic  event  to  the  rate  integral  is 
evaluated  using  the  dicicvinc  function,  and  the  rejec¬ 
tion  technique  is  uced  tc'  decide  whether  or  not  the 
scattering  event  occurs. + 


3.  RESl  LTS  AND  DISC  LSSION 

We  begin  by  considering  the  major  features  of  the 
spectrum  of  the  mode  sirengths(20]  Im[  -  l  .A  lq,  tu>  of 
the  light-  and  heavy-hole  plasma,  where  k  c„. 
Figure  I A  depicts  the  mode  strength  spectrum  for  the 
ftctitious  case  of  equal  light  and  heavy  hole  concen- 
tratio.ns.  p.  =  p,  =  10  cm  For  simplicity  and  clar¬ 
ity.  we  have  used  room- temperature  Fermi-Dirac 
distributions  and  ignored  overlap  factors  and  intcr- 
band  excitations.  The  modes  of  the  system  are  those 
of  a  two-component  plasma[21]:  a  heavily-damped 
acoustic  mode  with  linear  dispersion  for  small  q  and 
an  optical  mode  at  the  optica!  plasma  frequency 
cviop  =  \  fiy  -  coj, .  where  u;,  =  e'-p,  t  ^  and 
c;;.  =  c  pn  ( ,  m* .  Due  to  the  large  light-hole  concen¬ 
tration.  viqp  is  sufficiently  high  to  allow  the  two 
modes  to  be  widely  separated  and  distinct,  As  the 
light-hole  concentration  decreases  |Fig.  IB),  uop 
decreases  and  the  two  modes  approach  each  other. 
For  a  realistic  mix  of  light  and  heavy  holes, 
p.  =  (m*  ‘p  and  n.  -*-  p  —  10”  cm  ‘ '.  furth'r  de¬ 
crease  in  v3qp  together  with  phenomenological  damp¬ 
ing  results  in  a  joining  of  the  two  modes  into  the 
characteristic  two-lobe  spectrum  of  Fig.  1C.  The 
presence  of  the  acoustic  plasmon  then  serves  to 
extend  plasma  modes  to  somewhat  higher  wave  vec- 
tor»  It  should  be  noted  that  because  of  their  small 
effective  mass,  and  despite  their  relatively  small  con¬ 
centration,  light  holes  raise  the  plasma  frequency  by 
some  20%  and  should  therefore  not  be  ignored. 
Addition  of  overlap  factors  docs  not  alter  the  spec¬ 
trum  of  Fig.  1C  significantly.  Figure  ID  depicts  the 
effect  of  the  interband  terms,  which  result  in  a  slight 
shift  of  the  peak  to  lower  energies,  an  overall  ex¬ 
pression  of  its  strength,  significant  broadening,  and 
the  appearance  of  a  high-energy  tail  due  to  interband 
transitions. 


*We  have  performed  simulations  in  which  the  free-carrier 
contribution  to  the  dielectric  function  is  ignored.  The 
resulting  scattering  rales,  drift  velocities,  etc,  are  in 
excellent  agreement  with  those  obtained  in  simulations 
using  the  traditional  phonon  scattering  treatment 
Jit  is  imperative  to  view  the  phonon  line  with  sufficient 
energy  resolution  as  to  avoid  erroneous  conclusions 
concerning  its  behavior. 


I  he  c-'upiio-.'t.wdc  spectra  shown  tn  Fig  2  are 
.■•'lainea  trorr.  .'loic  simulations  lor  <*  =.  12  k\  cm 
.-.  -.1  p  =  |i-i  cm  .  as  described  above  W  ithoul  inter- 
■.■Aiid  terms,  the  mode  spectra  exhibit  a  small  degree 
aniscsir-’py  W  ith  the  addition  of  ir.tcrband  terms. 
!■,.•wevef.  antss'trsspic  effects  are  all  but  overwhelmed 
In  I'aci  minv’nty -electron  simulations  performed  with 
.  m-'t)  -ind  t-i  q.mi  give  very  similar  results, 
indicating  that  wiihiii  ihis  treatment,  minority  elec- 
11, ms  are  msi  seriously  affected  by  the  direction  in 
wtiiih  ihe  holes  travel  This  rather  interesting  results 
IS  m  agreement  with  the  observations  of  Lovtjoy 
,1 .1/  (22]  concerning  the  apparent  absence  of  a  signifi- 
.  .uU  drag  effect  on  the  minority -electron  mobility, 
.ilihough  the  absolute  value  of  the  mobilities  calcu- 
l.itcd  here  arc  higher  than  the  measured  values  (see 
Ivlow)  Low -temperature  experiments  on  modu¬ 
lation-doped  quantum  wells  have  shown  that  if  min¬ 
ority  electrons  are  effectively  decoupled  from 
phonons  and  ionized  impurities  and  only  interact 
with  majority  holes,  and  if  the  majority  hole  distri¬ 
bution  IS  sufficiently  anisotropic,  minority  electrons 
are  drastically  affected  by  the  motion  of  the  holes, 
even  to  the  point  of  exhibiting  negative  mobility  at 
low  ficld5[231  As  we  will  see,  at  p  =  IO”cm  ’. 
phonon  scattering  is  quite  weak  Within  the  present 
treatment,  strong  static  screening  reduces  the  import¬ 
ance  of  lonized-impurity  scattering  as  well,  .Minority 
electrons  are  then  coupled  mostly  to  the  hole  gas, 
although  not  as  exclusively  as  the  case  above  Also, 
the  hole  distribution  is  only  weakly  anisotropic,  even 
at  !2kV  cm  '  It  remains  to  be  seen  whether  or  not 
a  more  sophisticated  treatment  of  the  drifting  nature 
of  the  hole  plasma  produces  different  results, 

Tvp.tal  .node  strength  spectra  for  p  =  1.5  x  10'’, 
1.,‘i  X  10'*  and  I0”cm  ’  are  shown  in  Figs  3~5,  We 
begin  by  considering  the  long-wavelength  region.  At 
p=  1.5x10 'em  '.  the  iow-energy  plasmon-like 
mode  appears  at  more  or  less  the  optical  plasma 
frequency  and  the  high-energy  mode  is  slightly 
above  lu,,,.  As  the  concentration  increases  to 
1.5  X  lO'^cm  '.  both  the  plasmon-like  and  pnonon- 
like  modes  rise  in  energy.  At  p  =  lO”  cm  ’,  however, 
the  uncoupled  optical  plasma  frequency  is  already 
above  to^o-  resulting  in  a  phonon-like  mode  below 
wio  and  a  plasmon-like  mode  above  it.  In  (he  small- 
wavelcngth  limit,  screening  effects  are  negligible  and 
Ihc  small-wavelength  phonon-like  mode  approaches 
an  unscreened  phonon.  Figure  3  demonstrates  this 
behavior  dearly,  indicating  some  screening  of  the 
phonon  line  at  intermediate  wavelengths. J  As  the 
concentration  rises,  the  phonon  line  is  increasingly 
screened,  although  it  still  approaches  the  unscreened 
case  for  sufficiently  large  q.  It  should  be  noted  that 
plots  of  lm(- I  k- (q,  <-•;)]  do  not  give  a  direct  indi¬ 
cation  of  the  effectiveness  of  various  modes  in  elec¬ 
tron  scattering.  As  eqn  (1)  indicates,  the  long-range 
nature  of  the  Coulomb  term  q  ■  results  in  the 
disproportionate  importance  of  the  small-^  region. 
fF.qually  important  are  the  effects  of  the  boson 


Im  (  -1  /ic  (  q  ,0) )  I 


Im  (  -1 /K  (  q  ,ti>  >  I  \  Im  I  -I /K  (  q  ,(i) )  I  \  im  t -I /K  (  q  ,u> )  J 
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Fig,  2.  The  coupled-mode  strength  spectrum  for  p  =  iC’cm  ‘  and  -  12  kV  cm  '  without  (Aland  with 
(B)  interband  contributions.  Positive  and  negative  ^ -values  are  parallel  and  antiparailel  to  the  applied  field, 

respective!) 


population  favoring  small-o  events  and  the  require¬ 
ments  of  energy  and  momentum  conservation.!  At 
p  «  10” cm’’,  then,  phonon  scattering  is  expected  to 
be  essentially  inoperative. 

In  Fig.  6,  we  have  plotted  the  simulated  minority- 
electron  scattering  rate  density,  the  rate  of  minority- 
electron  scattering  by  the  coupled  hole-phonon 
system  per  unit  mode  energy.  Major  features  of  the 


mode  spectra  of  Figs  3-5  also  appear  In  Fig.  6.  The 
scattering  rale  density  exhibits  distinct  features  corre¬ 
sponding  to  the  plasmon-like  modes,  which  increase 
in  energy  with  the  concentration.  The  effect  of  the 
phonon-like  modes  diminishes  as  the  concentration  is 
increased.  Due  to  the  boson  population,  there  are 
many  scattering  events  at  small  m.  Figure  7  depicts 
the  minority-electron  energy  loss  rate  density,  the  rate 


0 
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Fig  3,  The  coupled-mode  strength  spectrum  for  p  *  1.5  x  lO'  cm  =  12  kV cir  '  and  }'»  n. 


of  minority-electron  energy  loss  to  the  coupled 
hole-phonon  system  per  unit  mode  energy.  Due  to 
the  small  energy  involved  in  a  small-<u  event,  as  well 
as  the  relative  unimportance  of  spontaneous  emis¬ 
sion,  the  net  energy  loss  rate  density  for  small  co  is 
quite  well  behaved. 


Theoretical  and  experimental  velocity-field  charac¬ 
teristics  of  minority  electrons  are  shown  in  Figs  8-10. 
At  p  =  10’’ cm  ’,  disagreement  between  theoretical 
and  experimental  velocities  does  not  exceed  20%, 
which  is  comparable  to  the  general  range  of  exper¬ 
imental  uncertainties.  At  p  =  10"  and  10" cm*’. 
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Fig.  5  The  coupled-mode  jirenglh  jpecirum  for  p  «  10" cm"  -  12  kVcm  '  and  T-  a 


however,  theoretical  characteristics  are  in  striking  mcni  is  inadequate  The  large  cakulated  vetocioes 

disagreement  with  experimental  data,  exhibiting  sig-  are  a  consequence  of  strong  energy  relaxation,  pre- 

nificantly  larger  high-field  drift  velocities  We  focus  venting  the  heating  of  electrons  and  their  transfer 

on  these  higher  hole  concentrations  because  they  to  upper  valleys,  coupled  with  insufficient  forward 

provide  more  stringent  tests  of  the  ability  of  the  momentum  relaxation,  which  allows  the  development 

theory  to  desenbe  the  interactions  of  minority  elec-  of  a  highly  anisotropic  distribution  A  comparison 

tions  with  holes  It  is  reasonable  to  expect  that  once  of  the  theoretical  and  experimental  drift  velocities 

the  theory  is  able  to  explain  the  expenmciilal  data  at  of  Figs  9  and  10  suggests  that  the  present  treat- 

the  higher  hole  concentrations,  an  explanation  of  the  ment  results  in  significantly  smaller  forward  momen- 

data  at  p  =  10*’ cm"’  would  be  naturally  afforded,  turn  relaxation  than  what  is  required  lo  explain 

Although  expenmenial  problems  such  as  the  likely  the  mcasurcment.s.  Furthermore,  a  companson  of 

presence  of  large  quantities  of  interstitial  Be  at  the  peak  fields  suggests  that  the  c’lcuiated  minonty- 

p  =  10”  cm may  be  invoked  to  explain  away  some  electron  energy  loss  is  perhaps  somewhat  too 

of  the  discrepancy,  it  is  clear  that  the  present  treat-  large 


Fig.  6,  The  simulated  rate  of  scattering  of  r -valley  minofily  Fig  t  The  simulaied  rale  of  energy  loss  by  r -valley 
electrons  by  the  coupled  hole-phonon  system  per  unit  mode  minority  elecironi  lo  the  coupled  hole  phonon  system  per 
energy  forp  •>  1.5  x  I0'’cm" phonon  peak  labeled  A),  unit  mode  energy  p  »>  1  S  x  10"cm  '  (-i- ,  phonon  peak 

1.5  X  I0’*cm'’  (O.  phonon  peak  labeled  B)  and  10'* cm  '  labeled  A),  1.5  x  10'*cm  ’  (□.  phonon  peak  labeled  B)  and 

(•,  phonon  peak  labeled  C).  10'* cm  '  (•.  phonon  peak  labeled  C) 
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Fig  8  Velocity -field  characierisiics  of  minority  electrons 
for  />  =  V,‘  =  15a  10'' cm  '  The  line  is  experimental  data 
from  Ref  [6]  and  calculated  points  arc  marked 


Fig  10  Velocity  field  characteristics  of  minority  electrons 
for  p  =  A’,  -  10 ''  cm  ’  The  tine  is  expenmentai  data  from 
Ref  fO)  and  calculated  points  are  marked 


Our  heave -doping  results  are  essentially  similar  to 
those  cf  Saito  ei  a/.(5].  and  differ  from  those  reported 
by  Tanivama  ei  ij/,(7].  Sano  et  at.  used  a  somewhat 
simpler  treatment  and  calculated  high-field  drift  vel¬ 
ocities  far  in  excess  of  experimental  results.  Taniyama 
et  a/  [7]  argued  that  the  static  screening  length  of  the 
two-component  p-iype  system  is  not  equal  to  the 
heavy-hole  value  alone.  They  replaced  the  heavy-hole 
effective  mass  with  the  hght-hole  value  to  calculate 
the  proper  screening  length  and  obtain  agreement 
with  experiment  We  believe  that  this  procedure  is 
unsatisfactory.  The  correct  static  screening  length  of 
the  two-component  system  for  arbitrary  degeneracy 
is  quite  easy  to  calculate,  and  has  been  used  by  us  in 
this  work  and  in  previous  publications(.Tj.  Both  light- 
and  heavy-hole  contributions  are  included,  and  the 
inlerband  contribution  is  zero  due  to  the  orthogonal¬ 
ity  of  same-i(c  states  in  the  two  bands.  The  heavy-hole 
contnbuiion  is  dominant,  indicating  effective  static 
screening  by  heavy  holes.  Therefore,  although  the 
light-hole  contribution  is  not  included  in  most  papers 
and  the  treatment  is  not  always  extended  to  arbitrary 
degeneracy,  all  workers  employ  essentially  the  same 
theory  of  static  screening.  Furthermore,  the  screened 
potential  must  result  in  light-  and  heavy-hole  phase 


Fig.  9  Velocity-field  characteristics  of  minonly  electrons 
for  p  =  S i  =  1 ,5  X  10'*  cm  ’.  The  line  is  experimental  data 
from  Ref  (6)  and  calculated  points  are  marked 


shifts  that,  at  least  in  some  approximation,  satisfy  the 
Friedel  sum  rule  Not  every  screening  length  will 
satisfy  this  constraint 

In  Fig  1 1.  we  have  ploiicd  available  experimental 
values  for  (he  minority -electron  mobility,  together 
with  values  calculated  by  us  and  by  Lowney  and 
Benne(t{8)  Our  calculated  values  are  in  agreement 
with  the  results  of  Lowney  and  Bennett  for  the 
plasma  cut-off  wave  vector  =  0.5  .  To  the  extent 

that  mobility  values  alone  can  be  used  as  indicators, 
our  treatment,  which  includes  dispersion  and  intrinsic 
and  extrinsic  damping,  suggests  that  plasma  modes 
extend  only  so  far  in  wave  vector  as  to  correspond 
to  an  effective  cut-off  at  O.Sr^'  Accordingly, 
although  extending  some  effective  cut-off  to  r; '  may 
be  eventually  borne  out  by  an  improved  treatment  of 
the  dielectric  function,  it  is  not  supported  by  the 
present  treatment.  This  is  not  necessarily  surprising. 
The  argument  that  a  plasma  cannot  support  oscil¬ 
lations  with  wavelengths  much  shorter  than  the 
screening  length  does  not  necessarily  mean  that  it 
does  support  undamped  and  undispersed  oscillations 
all  the  way  down  to  the  screening  length.  Extrinsic 
damping  can  destroy  plasma  oscillations  at  shorter 


acceptor  concent  rat  km  (  cm  •*  ) 


Fig.  1 1  Theoretical  and  experimental  values  of  the  min¬ 
ority-electron  mobility.  Emp;y  squares  are  experimental 
pomts{24|.  filled  circles  are  our  calculated  results  and  dashed 
and  solid  lines  are  Lowney  and  Bennett's  value  for 
</,  =  0.5r,  '  and  r,  ',  re$pectively[8]. 
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wave  vectors  Even  in  the  collisionless  case,  the 
statement  q,  =  r ;  may  not  rest  on  firm  ground.  In  a 
/7-type  plasma,  the  screening  length  is  essentially 
determined  by  heavy  holes,  whereas  light  holes 
and  imerband  transitions  provide  additional 
channels  for  intrinsic  damping  at  shorter  wave  vec¬ 
tors.  We  believe  that  these  issues  are  strong  incentives 
for  pursuing  calculations  of  the  complete  mode 
spectra  using  progressively  improved  dielectric  func¬ 
tions. 

We  now  turn  to  a  critical  evakaiion  of  the  treat¬ 
ment  used  in  this  work,  and  attempt  to  chart  the  road 
for  future  improvements.  When  all  electrons  are 
confined  to  the  f -valley,  as  in  the  =  lO'^cm  ' 
case,  the  drift  velocity  is  affected  by  two  major 
processes:  scattering  from  the  coupled  hole-phonon 
system  and  lonized-impurity  scattering.  As  noted 
above,  our  strategy  has  been  to  use  the  simplest 
possible  treatment  for  the  former,  while  maintaining 
full  frequency  and  wave  vector  dependence.  An  im¬ 
portant  limitation  of  this  formulation  is  the  phe¬ 
nomenological  treatment  of  the  effect  of  scattering  on 
the  dielectric  function.  Since  an  accurate  and  well- 
grounded  analysis  of  the  damping  of  plasma  modes 
and  the  effect  of  scattering  on  screening  may  welt 
be  central  to  the  problem  at  hand,  we  believe  that 
one  should  proceed  directly  to  a  true  relaxation- 
time  treatment  of  the  dielectric  function,  or  perhaps 
go  beyond  the  relaxation-time  approximation 
altogether.  Raman  measurements  can  serve  as  a 
valuable  guide  for  progress  in  this  area.  Equally 
important  is  the  development  of  a  nonperturbative 
theory  for  the  interactions  of  carriers  with  the  dynam¬ 
ically-screened  hole-phonon  system.  More  accurate 
hole  overlap  factors  could  also  be  beneficial.  As  the 
hole  concentration  increases  to  lO'^cm  scattering 
events  with  larger  energy  and  momentum  exchange 
become  prominent  Smaller  hole  overlap  factors 
could  reduce  the  relevant  contributions  to  the  dielec¬ 
tric  function  An  apparently  unappreciated  property 
of  Kane’s  overlap  factors(25|  is  their  troubling  behav¬ 
ior  as  q-‘0  with  nonzero  This  property  could  have 
an  adverse  effect  on  the  important  long  wavelength 
regions  of  Finally,  our  treatment  attempts  to 
gauge  the  importance  of  the  nonequilibrium  nature  of 
the  hole  plasma  by  simply  inserting  nonequilibrium 
distributions  into  an  equilibrium  theory.  Whereas  this 
is  a  reasonable  first  step,  it  is  clearly  not  the  ultimate 
objective.  Although  experimental  evidence  for  the 
apparent  absence  of  a  significant  drag  effect[22] 
suggests  that  the  nonequitibrium  nature  of  the  hole 
plasma  may  not  play  a  dominant  role  in  determining 
the  minority-electron  mobility,  a  fully  nonequi¬ 
librium  treatment  should  be  developed  in  due  course. 
It  is  also  known  that  as  the  impurity  concentration 
increases,  a  number  of  complications  arise  in  the 
theory  of  ionized-impurity  scattering(26].  As 
increasingly  sophisticated  models  for  minority-carrier 
transport  at  high  dopings  are  developed,  such  compli¬ 
cations  should  be  kept  in  sight. 


In  summary,  we  have  calculated  velocity -field 
characteristics  of  minority  electrons  in  p  ’  -GaAs  in¬ 
cluding  dynamic  screening  and  plasmon-phonon 
coupling  using  a  phenomenologically  damped  quasi- 
nonequiUbrium  two-band  dielectnc  function  Our 
results  suggest  that  under  conditions  of  interest  here, 
the  drifting  nature  of  the  hole  gas  does  not  affect  the 
minority-electron  drift  velocity  seriously.  Moreover, 
calculated  velocities  arc  significantly  higher  than  ex¬ 
perimental  values,  indicating  that  larger  forward 
momentum  relaxation  is  required  to  account  for  the 
experimental  data. 
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We  review  measurements  of  abrupt  transitions  in  III-V  compound  semiconductors  and 
heterostruciures,  including  abrupt  interfaces  as  found  in  quantum  wells  ( QWs)  and  high  electron 
mobility  transistors  (HEMTs),  and  abrupt  doping  profiles  such  as  delta  doping.  The  interface 
quality  of  QW  and  HEMT  structures  can  be  studied  using  photoluminescence  (PL).  Several 
methods  are  employed  to  characterize  the  abrupt  doping  profiles  of  delta-doped  structures.  Hall 
and  Shubnikov-de  Haas  measurements  are  used  to  extract  the  2D  carrier  concentrations  in  detail. 
C-V profiling  is  employed  to  study  the  carrier  profiles  normal  to  the  dopant  plane.  The  actual 
dopant  distribution  can  be  obtained  by  secondary  ion  mass  spectrometry  or  by  combining  C-V 
profiling  with  theoretical  calculations.  The  resolution  of  these  measuring  techniques  will  be 
discussed. 


I.  INTRODUCTION 

Abrupt  transitions  in  IIl-V  compounds,  including  abrupt 
heterojunctions  and  doping  profiles,  have  been  of  increasing 
importance  in  high  speed  electronic  and  optical  device  fabri¬ 
cation  and  design.  The  rapid  development  of  advanced  semi¬ 
conductor  growth  technologies,  such  as  molecular  beam  epi¬ 
taxy  (MBE)  and  metalorganic  chemical  vapor  deposition 
(MOCVD),  has  greatly  improved  the  crystal  quality  and 
reliability  of  materials.  This  leads  to  nearly  ideal  hetero¬ 
structures  with  abrupt  interfaces.  The  interface  roughness 
plays  a  dominating  role  in  determining  the  performance  of 
heterostructure  electronic  and  optoelectronic  devices.  To  in¬ 
vestigate  the  properties  of  the  abrupt  interfaces,  photolumi¬ 
nescence  (PL)  at  low  temperatures,  typically  liquid  helium 
or  liquid  nitrogen  temperature,  is  a  sensitive  and  powerful 
tool.'  "  The  PL  spectra  can  provide  information  about  the 
interface  quality  as  well  as  two-dimensional  electron  (hole) 
gas  (2DEG). 

Extremely  abrupt  doping  in  III-V  semiconductors  has 
been  demonstrated  by  delta-doping,  also  called  atomic  layer 
doping.  Abrupt  doping  with  donors  such  as  Ge,  ’  Si,^'^  and  Se 
(Ref.  6)  in  GaAs,  and  Si  (Ref.  7)  in  InGaAs,  has  been  stu¬ 
died,  and p-type  doping  using  C  (Ref.  8)  and  Be  (Ref.  9)  in 
GaAs  has  been  reported.  The  physics  and  applications  of 
delta  doping  in  III-V  compounds  and  heterostructures  are 
very  interesting  and  have  been  reviewed  in  several  recent 
articles.'"' '  In  this  doping  technique  the  dopants  are  ideally 
deposited  onto  a  single  plane  by  interrupting  epitaxial 
growth.  The  dopants  are  subject  to  diffusion  and  segregation 
during  growth,'^ ''  and  therefore  it  is  important  to  subse¬ 
quently  measure  the  doping  distribution.  Hall  measure¬ 
ments  of  delta-doped  semiconductors  have  shown  high  two- 
dimensional  carrier  concentrations''’  and  enhanced 
mobilities.  Shubnikov-de  Haas  (SdH)  measurements  assert 
the  existence  of  a  two-dimensional  electron  gas  in  the  delta¬ 
doping  layer, and  are  used  to  calculate  the  subband  carrier 
concentrations  associated  with  the  2D  system.'"  Along  the 
growth  direction,  the  carrier  profiles  are  extracted  from  ca¬ 


pacitance-voltage  measurements,  which  is  C-V  profiling. 
The  C-V  profile  widths  of  the  delta  doping  are  much  less 
than  the  Debye  length,  which  is  the  resolution  of  conven¬ 
tional  C-Fprofiling.  This  suggests  that  electrolyte  C-F pro¬ 
filing,  with  resolution  typically  larger  than  50  A,  is  not  useful 
for  such  abrupt  doping  profiles.  The  actual  dopant  profile 
widths  are  of  the  same  order  as  the  carrier  profiles,  as  veri¬ 
fied  by  secondary  ion  mass  spectroscopy  (SIMS)."  How¬ 
ever.  to  quantitatively  evaluate  the  actual  doping  distribu¬ 
tions,  theoretical  calculations  combined  with  measurements 
such  as  C-V  profiling.  Raman  spectroscopy,"*  and  SdH 
measurements,  are  still  under  study.  It  is  believed  that  the 
dopants  are  confined  in  the  regime  where  quantum  size  ef¬ 
fects  occur. 

II.  ABRUPT  INTERFACES 

III-V  compound  semiconductors  provide  a  variety  of  lat¬ 
tice-matched  heterostructures  with  abrupt  interfaces,  and  a 
range  of  band  discontinuities.  The  most  widely  studied  ter¬ 
nary  and  quaternary  alloys  are  in  the  InAIGaAs  and 
InGaAsP  systems,  and  most  recently  the  InGaAsSb  system. 
This  variety  of  materials  results  in  a  wealth  of  applications  in 
high  speed  electrical  and  optical  devices,  such  as  high  elec¬ 
tron  mobility  transistors  (HEMTs)  and  quantum  well 
lasers.  The  layer  abruptness  of  such  heterostructures  play  an 
important  role  in  determining  their  performance.  For  exam¬ 
ple,  in  HEMT  structures,  imperfection  of  the  interface  along 
which  the  2DEG  conducts,  reduces  the  mobility  of  the 
2DEG  due  to  the  interface  scattering  of  carriers.  For  optical 
devices  employing  quantum  well  structures,  interface 
roughness  as  well  as  layer  mixing  results  in  optical  spectral 
linewidth  broadening. 

Photoluminescence  ( PL )  is  a  nondestructive  technique  to 
measure  radiative  transitions  in  semiconductors,  and  is  espe¬ 
cially  useful  in  compound  semiconductors.  For  QW  struc¬ 
tures,  the  PL  spectra  can  extract  important  information 
such  as  recombination  mechanisms,^"  interface  roughness,* ' 
and  the  impurities  in  theQWs.*^  *'  The  lowest  subband  tran- 
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Fig.  1.  PL  spectra  from  four  quantum  wells  with  different  thickness.  Sam¬ 
ples  were  grown  b>  MBE  at  As  cracking  temperatures  of  increasing  order  of 
(a),  (b),  (c),  and  Id),  corresponding  to  the  increase  of  AsO.  Significant 
degradation  of  the  PL  spectra  is  observed  in  the  presence  of  AsO  while  the 
AsO  cannot  be  detected  by  RG A  ( Ref.  25 ) . 


sition  of  QWs  corresponds  to  a  distinct  peak  distinguished 
from  the  bulk  material  signals.'*'  This  peak  can  therefore 
accurately  determine  the  thickness  of  the  quantum  well.^"* 
The  linewidth  of  the  QW  peak  is  a  sensitive  index  of  the 
interface  roughness.  For  AlGaAs/GaAs  QWs  grown  by  mo¬ 
lecular  beam  epitaxy,  our  group  has  demonstrated  strong 
dependence  of  PL  spectra  on  the  QW  degradation  induced 
by  AsO  from  the  As,  cracking  source,"^  while  the  AsO  can¬ 
not  be  detected  by  residual  gas  analysis  (RGA).  Figure  1 
shows  broadening  of  linewidth  and  reduction  of  intensity  in 
PL  spectra,  as  a  function  of  AsO  content.  In  electrical  de¬ 
vices  using  HEMT  or  MDQW  (modulation-doping  quan¬ 
tum  well )  structures,  PL  spectra  can  provide  information  of 
the  two-dimensional  electron  (hole)  gas.^*  Our  group  has 
illustrated  the  relation  between  the  PL  spectral  linewidth 
and  the  2DEG  densities  for  AlGaAs/GaAs  (Ref.  27)  and 
pseudomorphic  AlGaAs/InGaAs/GaAs  MDQWs.^"  The 
FWHM  ( full  width  at  half  maximum )  of  the  77-K  PL  spec¬ 
tra  can  accurately  measure  the  2DEG  densities,  as  illustrat¬ 
ed  in  Fig.  2(a).  This  is  due  to  the  linewidth  broadening  in¬ 
duced  by  the  thermal  motions  of  carriers.  This  technique 
provides  a  nondestructive  method  to  extract  the  carrier  con¬ 


centrations,  as  verified  by  Hall  measurement.  The  ratio  of 
linewidths  at  77  and  4.2  K  is  linearly  dependent  on  the  Hall 
mobilities,  and  indicates  the  crystal  quality  since  the  4-K  PL 
spectra  is  sensitive  to  the  interface  scattering  while  the  77  K 
linewidth  is  not.  The  4.2-K  PL  spectra  can  also  predict  the 
Fermi  energy  position  in  the  ME)QWs. 

III.  DELTA  DOPING 

Delta  doping  in  compound  semiconductors  produces  the 
narrowest  doping  profile  reported  so  far.  and  has  realized 
novel  electrical  and  optical  devices,  such  as  ^-FETs  (delta- 
doped  field  effect  transistors),^’'^"  and  saw-tooth  superiat- 
tice  lasers.^'  Delta  doping  has  exhibited  2D  carrier  concen¬ 
tration  much  higher  than  that  achieved  by  uniform  doping. 
For  Si-doped  GaAs,  delta  doping  has  demonstrated’  2D 
electron  densities  >  1  X 10”  cm“^  as  cciupared  to  the  sa¬ 
turation  concentration  about  1.7X  10”  cm“^  in  uniformly 
doped  GaAs.  The  strong  localization  of  carriers  about  the 
dopant  plane,  as  shown  in  Fig.  3,  gives  a  strong  screening 
effect  of  the  ionized  impurity  scattering  to  the  free  carriers, 
resulting  in  enhanced  carrier  mobilities.  The  use  of 
delta  doping  in  modulation  doping  devices”-”  such  as 
MODFETs  has  demonstrated  improved  transconductance 
and  therefore  current  driving  capability  due  to  high  density 
2DEG,  as  verified  by  experiments  and  theories.’*"**  Delta 
doping  can  also  be  employed  in  submicron  devices  to  im¬ 
prove  short  channel  effects.  It  has  been  shown  in  conven¬ 
tional  HEMT  devices  that  the  carrier  mobilities  are  drasti¬ 
cally  reduced  at  high  electric  field,”  setting  a  lower  limit  on 
the  channel  length.  Devices  using  the  delta-doping  plane  as  a 
conduction  channel  such  as  ^-FETs,'”*  however,  suffer  much 
less  reduction  of  carrier  mobilities  at  high  field.  The  5-FETs 
show  better  performance  than  the  conventional  HEMT,  in 
submicron  channel  length  regime.  Recently,  delta  doping  in 
quantum  wells”  has  been  employed  in  FET  device  fabrica¬ 
tion*"  since  the  quantum  wells  provide  additional  carrier 
confinement,  yielding  improved  breakdown  voltages. 

IV.  CARRIER  CONCENTRATIONS 

The  2D  carrier  concentrations  in  the  delta-doping  layer 
are  of  great  importance  and  provide  the  current  flow  used  for 
devices.  The  understanding  of  the  2D  carrier  system  can  be 
of  great  help  in  optimizing  device  design  and  fabrication. 
Hall  effect  measurement  is  by  far  the  most  common  tech¬ 
nique  to  measure  free-carrier  densities  and  mobilities  in 
semiconductors  at  weak  magnetic  fields,*'  typically  of  the 
order  of  kilogauss.  Measured  Hall  concentrations  of  about 
10”  cm  "  ^  have  been  reported  for  Si  delta  doping  in  GaAs,’ 
as  compared  to  Hall  concentrations  of  <  1  x  10”  cm“^  for 
AlGaAs/GaAs  HEMTs.  The  Hall  mobilities  of  Si  delta  dop¬ 
ing  in  GaAs  are  enhanced  compared  to  that  of  equivalent 
uniform  doping,  due  to  strong  screening  of  Coulombic  po¬ 
tential  induced  by  the  ionized  Si  dopants.  Moreover,  the  low- 
temperature  Hall  concentrations  of  delta-doped  GaAs  show 
dependence  on  the  applied  magnetic  field.*’  This  results 
from  the  fact  that  in  the  delta-doped  samples  there  are  sever¬ 
al  conduction  channels  with  different  carrier  mobilities,*’ 
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Fig,  2.  ( a )  a  PL  spectrum  ofa  M  DQW  from  which  the  Fermi  energy  position  can  be  extracted.  ( b )  77-K  PL  linewidthsas  a  function  of  sheet  carrier  density 
measured  by  Hall  measurements,  showing  a  good  linear  dependence,  (c)  ratio  of  linewidths  at  77  and  4.2  K  as  a  function  of  77  K  Hall  mobility  (Ref  27). 


O  CAV.UIOM 


Fig.  3  Schematic  diagram  of  Si  della  doping  in  GaAs,  where  the  Si  dopants 
are  ideally  deposited  on  a  single  plane  during  the  epitaxial  growth  interrup¬ 
tion. 


due  to  subbands  induced  by  the  dopant  plane.^  A  similar 
field-dependent  effect  of  Hall  concentrations  can  also  be 
found  in  AIGaAs/GaAs  HEMTs/^  where  the  carriers  in  the 
AlGaAs  layer  are  not  fully  depleted  and  thus  form  another 
conduction  channel.  The  Hall  concentration  is  therefore  a 
weighted  sum  of  the  carrier  density  of  each  subband,  as  veri¬ 
fied  by  temperature-dependent  Hall  measurements.'^ 

To  investigate  the  subband  system  associated  with  the  del¬ 
ta-doping  layer,  Shubnikov-de  Haas  measurements  can  be 
used  to  extract  the  carrier  concentration  of  each  subband 
from  the  oscillation  period  of  magnetoresistance  versus 
magnetic  field.'*^  The  SdH  measurement  requires  high  mag¬ 
netic  field,  of  the  order  of  tesla  ( 10^  G),  and  liquid  helium 
temperature  operation.  Figure  4  shows  t3rpica]  MH  oscilla¬ 
tions  of  longitudinal  magnetoresistance  versus  inverse  mag¬ 
netic  field  for  delta-doping  structures.  The  data  indicate  the 
existence  of  the  2DEG  and  separate  subbands. Each 
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(-doped  GaAs(Si) 


Fig.  4,  SdH  measurement  of  Si  delta  doping  in  GaAs,  showing  the  process¬ 
ing  of  the  transverse  magnetoresistance  to  a  FFT  spectrum,  indicating 
the  location  of  each  subband  and  its  carrier  concentration  { Ref.  44). 


group  of  oscillations  corresponds  to  one  occupied  subband 
and  subband  carrier  populations  can  be  calculated  from  the 
oscillation  periods.  Fast  Fourier  transform  (FFT)  tech¬ 
niques  have  been  employed  to  expedite  the  calculation.  The 
subband  mobilities  of  the  2DEG  can  also  be  estimated 
through  the  SdH  measurement."’ 


vocTioe  vivi 


Fig.  5.  (a)  The  measured  C-Fcurvc,  (b)  the  C-F’profile  depth,  and  (c)  the 
C-F profile  with  a  FWHM  of  40  A,  for  Si  delta-doped  GaAs  (Ref.  53). 

the  electrons  are  described  by  wave  functions  corresponding 
to  subbands,  rather  than  by  dopant  distribution  functions.’^ 
The  C-F  profiling  thus  shows  the  expectation  values 
through  the  conventional  formulation.’^  Theoretical  calcu¬ 
lations  demonstrate  that  the  resolution  of  the  C-F  profiling 
in  the  delta-doping  structures  is  the  spatial  extent  of  the 
ground  state  wave  function,’"  typically  of  the  order  of  lattice 
constants.  For  Si  delta  doping  in  GaAs,  a  typical  C-F  curve 
and  the  C-  F  depth  are  illustrated  in  Fig.  5.  The  resulting  C- 
F carrier  profile,  also  shown  in  Fig.  5,  exhibits  a  FWHM  of 
40  A,  much  less  than  the  Debye  length.  The  profile  width 
strongly  depends  on  the  dopant  diffusion  and  therefore  the 
substrate  temperature  during  growth.”  The  C-F  profiles  of 
Be  delta  doping  in  GaAs  at  low  growth  temperature 
<  550  ’C  are  even  narrower  than  that  of  Si  delta  doping,  as 
illustrated  in  Fig.  6.  The  narrowest  C-F  profile  FWHM  of 
12  A  yet  reported  was  obtained  in  delta-doped  quantum 


V.  CARRIER  AND  DOPANT  PROFILES 

The  actual  distributions  of  dopants  and  carriers  along  the 
growth  direction  in  delta-doping  structures  should  be  mea¬ 
sured  to  understand  dopant  incorporation  during  epitaxial 
growth  and  carrier  localization  mechanisms.  Capacitance- 
voltage  (C-F)  measurement  is  a  well-accepted  electrical 
profiling  technique  for  doping  in  semiconductors."”  The  C- 
F  profiling  method  measures  the  free-carrier  profile  in  a 
Schottky,/>^n,  or  n^p  diode  structure,  based  on  the  deple¬ 
tion  approximation.  The  carrier  profile  is  a  good  approxima¬ 
tion  to  the  actual  doping  profile  for  homogeneous  or  slowly 
varying  doping.  The  resolution  of  C-F  profiling  is  of  the 
order  of  Debye  length,’””  typically  >  50  A.  This  has  in  the 
past  led  to  a  conclusion  that  carrier  profiles  any  more  abrupt 
than  a  Debye  length  cannot  be  detected  by  the  C-  F measure¬ 
ment.  However,  this  conclusion  does  not  hold  in  cases  for 
which  extremely  sharp  variation  such  as  delta  doping  oc¬ 
curs,  because  of  quantum  size  effects.  In  delta-doping  layers. 


OEPTM  I  (t) 
{•) 


(b) 


Fig.  6.  (a)  The  C-F  profile  with  FWHM  of  20  A  and  (b)  the  SIMS  profile 
of  the  Be  della  doping  in  GaAs  grown  at  500 ’C  by  MBE  (Ref  8). 
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Fio.  7.  The  narrowest  C-  V profile  with  FWHM  of  12  A,  as  reported  in  Ref. 
56  in  a  delta-doped  GaAs/InGaAs  quantum  well. 


FlO  8.  The  C-Fprofilesofdeltadoping  in  (a)  bulk  GaAs  (b)  I50AQW. 
(c)  100  A  QW,  and  (d)  50  A  QW  samples.  All  samples  were  grown  at 
600  *C  b)  MBE  with  identical  thermal  treatment  after  the  delta-doping 
layers  were  deposited,  yielding  identical  dopant  distributions. 


Fio.  9.  Coir  risen  of  FWHM  of  C-  V profiles  based  on  theories  and  experi¬ 
ments.  indicating  that  the  dopants  are  in  fact  confined  in  a  monolayer  plane 
(Ref  53).  The  dz  represents  the  FWHM  of  the  dopant  distribution. 


wells,’*’  as  shown  in  Fig.  7.  The  additional  confinement  of 
the  quantum  wells  gives  rise  to  much  narrower  carrier  pro¬ 
files  than  that  of  delta  doping  in  bulk  materials.  Our  studies 
of  delta-doped  quantum  wells  have  demonstrated  a  signifi¬ 
cant  narrowing  effect  of  the  C-V  profiles.”  Figure  8  shows 
that  the  FWHM  of  the  C-V  profiles  decreases  as  the  quan¬ 
tum  well  width  decreases.’^ 

The  actual  dopant  distributions  of  the  delta  doping  are 
important  information  to  determine  the  dopant  diffusion 
during  growth  or  annealing.’"  Secondary  ion  mass  spectros¬ 
copy  (SIMS)  has  been  used  to  study  the  dopant  diffusion’"* 
and  verify  that  the  dopant  distributions  of  the  delta  doping 
are  close  to  a  delta  function,  as  shown  in  Fig.  6(b).  The 
resolution  of  the  SIMS  technique  is  limited  by  roughening  of 
the  surface  and  the  “knock-on-cffect,”  which  tend  to  in¬ 
crease  the  uncertainty  and  width  of  the  dopant  profile."  This 
can  be  seen  in  Fig.  6,  in  which  the  C-  F profile  shows  narrow¬ 
er  width  than  the  SIMS  profile.  Theoretical  calculations  ob¬ 
tained  by  fitting  dopant  distributions  into  the  C-V  carrier 
profiles,  as  illustrated  in  Fig.  9,  reveal  that  the  dopants  are 
spatially  confined  within  a  few  lattice  constants.  However, 
the  presence  of  DX  centers  for  heavily  Si  doped  GaAs  may  in 
fact  cause  a  narrowing  effect  on  the  C-Fcarrier  profile,  indi¬ 
cating  that  the  actual  dopant  distributions  may  be  wider 
than  previously  assumed.*” 

In  conclusion,  the  measurements  of  the  abrupt  transitions 
in  III-V  materials  require  high  resolution  measuring  tech- 
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nique  as  well  as  theoretical  calculations  involving  quantum 
mechanics  and  statistical  mechanics.  The  PL  has  been  able 
to  measure  the  abruptness  of  intrinsic  or  lightly  doped  heter¬ 
ostructures  such  as  quantum  wells,  and  theelectrical  proper¬ 
ties  of  doped  structures  such  as  MDQWs.  To  measure 
abrupt  doping  profiles  such  as  delta  doping,  it  is  essential  to 
employ  measuring  tools  with  resolutions  of  the  order  of  lat¬ 
tice  constants.  Full  investigation  of  the  delta-doping  layers, 
however,  needs  the  combining  of  experiments  and  calcula¬ 
tions  and  is  a  very  attractive  topic  in  physics  and  device  ap¬ 
plications. 
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The  high-resistivity  properties  of  low-temperature  (200-300*0  molecular-beam-epitaxy-grown 
GaAs  and  AlGaAs  have  great  potential  for  device  applications.  The  purpose  of  this  study  is  to 
examine  the  effect  of  low-temperature  buffers  (LTB)  on  the  crystal  quality  of  epilayers  grown 
on  top.  Photoiuminescence  (PL)  from  quantum  wells  grown  at  regular  growth  temperatures 
(620  *C)  is  found  to  be  sensitive  to  changes  in  crystal  quality  introduced  by  the  LTB.  Limiting 
LT  AlGaAs  buffer  thicknesses  to  a  critical  thickness  is  found  necessary  for  growing  layers  with 
good  quantum-well  PL. 


I.  INTRODUCTION 

Molecular-beam-epitaxy  (MBE)  grown  low-temperature 
(LT)  GaAs  has  been  found  to  exhibit  very  interesting 
properties.  Annealed  LT  GaAs  exhibits  high  resistivity  and 
has  been  used  as  buffer  layers  to  reduce  sidegating  and 
backgating  of  metal-semiconductor  field-effect  transistors 
(MESFETs)  and  high-electron-mobility  transistors. 
Other  applications  include  use  of  the  LT  layer  as  an  isola¬ 
tion  buffer  layer  for  solar  cells.'*  The  LT  GaAs  has  also 
been  used  as  the  photoconductor  material  for  TH?  pulses.* 
Low-temperature  buffers  appear  to  reduce  threading  dislo¬ 
cations  in  lattice-mismatched  devices.* 

These  layers  have  been  found  to  contain  about  1-2 
at.  %  excess  As  which  forms  precipitates  on  annealing  the 
layers  to  about  6(X)  The  high-resistivity  properties 
have  been  attributed  to  the  presence  of  these  precipitates  in 
both  AlGaAs  and  GaAs.  The  addition  of  aluminum  ap¬ 
pears  to  increase  the  resistivity  by  an  order  of  magnitude 
and  makes  LT  AlGaAs  preferable  to  LT  GaAs  as  an  iso¬ 
lation  buffer. 

These  LT  layers  can  be  used  as  “active’’  layers  or  as 
isolation  layers  only  if  the  epitaxial  films  grown  on  top  are 
smooth  and  free  from  defects  induced  by  the  underlying 
LT  layer.**  Photoiuminescence  is  sensitive  to  changes  in 
quantum-well  interfaces  and  material  quality.'*’**  The 
rough  interface  or  defects  migrating  to  the  quantum-well 
interface  from  the  LTB  would  influence  the  PL. 


II.  EXPERIMENTAL  STUDIES 

Semi-insulating  (100)  GaAs  wafers  were  used  in  this 
study.  The  wafers  were  first  cleaned  in  de-ionized  (DI) 
water  for  2  min  and  blown  dry  in  dry  nitrogen.  They  were 
then  mounted  to  molybdenum  blocks  using  indium  solder. 
After  the  cleaning  and  mounting  process,  the  blocks  were 
loaded  into  a  Varian  Gen  II  MBE  system  and  outgassed  at 
450  *C  for  1  h.  The  oxide  was  desorbed  by  heating  the 
sample  under  an  As  overpressure  at  670  *C.  The  layers 
were  grown  at  an  As/Ga  incorporation  ratio  of  1.8  and  a 
growth  temperature  of  620  *C.  The  As/Ga  BEP  ratio  was 
about  15.  The  arsenic  was  obtained  from  a  cracker 
source.  '* 


All  samples  incorporated  four  quantum  wells  grown  on 
a  LT  buffer.  The  thickness  of  the  LT  buffer  layer  was  var¬ 
ied  from  0  to  1.5  /xm  in  different  samples.  LT  GaAs  buffers 
were  grown  at  230  and  300  *C,  and  LT  AlojGao  7AS  buffers 
were  grown  at  230  and  270  *C.  These  buffer  layers  were 
grown  on  top  of  a  0.5  /xm  smoothing  layer  of  GaAs  and 
were  annealed  for  10  min  at  620  °C  under  an  AS2  overpres¬ 
sure.  Then  four  GaAs  quantum  wells  (35,  19,  11,  and  5 
ML)  were  grown  with  200  A  AlosGao  jAs  barriers  at 
620  *C  on  top  of  the  LTB.  A  four-quantum-well  reference 
sample  without  any  LT  buffer  was  also  grown  for  compar¬ 
ison  studies. 

Photoiuminescence  (PL)  measurements  were  done  on 
the  quantum  wells  at  4.2  K.  The  514.5  nm  line  from  an 
argon-ion  laser  was  used  as  the  excitation  source  at  power 
densities  of  100  mW/cm^.  A  typical  spectrum  for  the  ref¬ 
erence  (no  LT  buffer)  sample  is  shown  in  Fig.  1. 

III.  RESULTS  AND  DISCUSSION 

We  observe  a  degradation  in  the  PL  of  quantum  wells 
on  top  of  LT  AlGaAs  grown  at  either  230  or  270  "C.  In 
fact,  PL  from  the  quantum  well  (35  ML)  nearest  to  the 
LTB  was  not  detectable.  The  spectral  linewidth  of  the  5 
and  11  ML  wells  increased  as  the  thickness  of  the  LT 
AlGaAs  grown  at  230  *C  increased  above  0.75  ^m,  as 
shown  in  Fig.  2.  The  peak  luminescence  of  all  the  quantum 
wells  with  LTB  thicknesses  greater  than  0.75  ^m  was  red 
shifted  (Fig.  3).  The  onset  of  this  shift  in  the  peak  energy, 
together  with  a  change  in  linewidths  and  a  decrease  in  the 
integrated  PL  intensities,  occurred  at  the  same  LTB  thick¬ 
ness  of  0.75  /xm.  LT  AlGaAs  grown  at  a  higher  tempera¬ 
ture  of  270  *C  showed  a  smaller  red  shift  of  peak  PL  at 
about  1.5  /xm  (Fig.  4).  In  this  case,  the  linewidth  and  the 
integrated  PL  intensity  did  not  change  significantly  as  the 
LTB  thickness  increased.  The  greater  degradation  of  PL 
for  lower  growth  temperatures  may  be  due  to  greater  in¬ 
corporation  of  As  in  the  lattice. 

The  migration  of  cxcitons  to  thicker  wells  would  explain 
the  red  shifting  of  the  peaks.  This  thermalization  would 
only  occur  if  the  growth  island  size  is  smaller  than  the 
exciton  diffusion  length  (about  1.3-1. 8  nm).  Quantum 
wells  with  a  ragged  interface  would  experience  different 
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Fig.  t.  Typical  spectrum  for  the  rour-<)uantum-wetl  structure  without  a 
LT  buffer  layer. 

confinements  depending  on  the  position  of  the  exciton  in 
the  well.  The  PL  from  such  structures  would  result  in  a 
relatively  broad  peak  due  to  transitions  at  different  ener¬ 
gies. 

LT  GaAs  grown  at  230  *C  under  the  same  growth  con¬ 
ditions  did  not  result  in  any  degradation  in  quantum-well 
PL  linewidth  or  the  red  shift  in  peaks.  All  four  quantum- 
well  peaks  appeared  in  the  spectrum.  The  integrated  PL 
intensity  remained  approximately  constant  for  all  the  LT 
GaAs  buffers.  LT  GaAs  grown  at  300  *C  did  not  show  any 
degradation  for  the  buffer  thicknesses  (1.5  /.cm)  studied. 

After  the  10  min  anneal,  the  reflection  high-energy  dif¬ 
fraction  (RHEED)  (2X4)  surface  reconstruction  was 
found  to  recover  for  both  LT  GaAs  and  LT  AIGaAs. 
From  qualitative  studies  of  the  RHEED  patterns,  the  LT 
AIGaAs  layers  did  not  smooth  out  during  the  annealing 
period  to  the  same  extent  as  the  LT  GaAs  layers.  This  is 
consistent  with  the  complete  disappearance  of  the  first 
quantum-well  PL  in  the  LT  AIGaAs  layers.  The  bond  en¬ 
ergy  for  Al-As  is  greater  than  Ga-As,  which  contributes 
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FtO.  2.  Influence  of  LT  A!G*As  liyer  thickness  on  FWHM  of  PL  for  two 
quantum-well  widths. 
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Fig.  T.  Change  in  peak  PL  energy  of  quantum  wells  with  the  tfaickneia  of 
LT  AlCaAs  grown  at  230  *C.  The  peak  energy  shifts  to  lower  eoesgies  as 
the  eaciion  themalizes  to  thicker  welts. 


to  increased  surface  roughness  and  slower  recovery  of 
RHEED  intensities. 


iV.  CONCLUSION 

The  experiments  found  a  degradation  of  PL  for  quan¬ 
tum  wells  grown  on  top  of  LT  buffers  which  had  been 
grown  at  different  temperatures  in  the  range  of  230-300  'C. 
The  study  of  LTB  material  included  both  GaAs  and  AI¬ 
GaAs.  We  conclude  that  surface  roughness  from  the  LT 
GaAs  is  not  a  problem  if  the  buffers  are  annealed  for  times 
long  enough  for  the  RHEED  intensities  to  recover.  There 
is  a  thickness  limit  on  the  LT  AIGaAs  layers  that  are 
grown  under  similar  growth  conditions.  This  limit  appears 
to  be  related  to  the  intrinsic  roughness  introduced  by  Al 
and  the  amount  of  excess  arsenic  that  is  incorporated  in  the 
LT  AIGaAs  buffer.  A  higher  temperature  for  growth  of 
these  LT  layers  appears  to  reduce  the  associated  surface 
roughness.  The  reduced  surface  mobility  of  Al  and  Ga  at 
these  low  temperatures  could  have  a  role  in  thq  contribu¬ 
tion  to  surface  roughness. 
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Flo.  4.  Change  in  peak  PL  energy  of  quantum  wells  with  the  thickness  of 
AIGaAs  LT  buffer  grown  al  270  *€. 
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We  report  the  eff«:ts  of  changing  AS4/AS2  flux  ratio  from  an  As  cracking  source  on  Si  and  Be 
5-doped  GaAs  grown  by  molecular-beam  epitaxy.  It  is  observed  that  the  carrier  concentration 
increases  as  the  AS4/AS2  flux  ratio  increases.  The  spatial  confinement  of  carriers  in  the  6-doped 
induced  potential  well  is  also  enhanced  using  high  AS4/AS2  flux  ratio.  These  effects  are 
attributed  to  the  enhancement  of  dopant  incorporation  by  As^  during  the  6-doping  growth 
period. 


I.  INTRODUCTION 

The  technique  of  8  doping  in  semiconductors  has  moti¬ 
vated  a  large  amount  of  research  on  device  applications 
and  material  studies.  By  introducing  a  single  plane  of  dop¬ 
ants  into  semiconductors,  a  variety  of  novel  devices  have 
been  suggested.'  In  practice,  the  8  doping  is  accomplished 
by  depositing  dopant  atoms  onto  a  growth-interrupted 
semiconductor  surface  using  epitaxial  growth  techniques, 
such  as  molecular-beam  epitaxy  (MBE)  or  metalorganic 
chemical  vapor  deposition  (MOCVD).  Abrupt  doping 
profiles  with  widths  on  the  order  of  few  lattice  constants 
have  been  confirmed  by  various  characterizing  techniques 
and  theoretical  calculations.^  For  example,  an  extremely 
narrow  capacitance-voltage  (C-F)  profile  with  a  full 
width  at  half-maxima  (FWHM)  of  5  A  has  been  reported 
in  6-doped  quantum  wells.'  The  abrupt  dopant  distribu¬ 
tions  in  6-doped  structures  give  rise  to  many  interesting 
phenomena,  such  as  quantum-size  effect  from  the  dopant- 
induced  potential  well,  enhanced  carrier  mobilities,  and 
enhanced  carrier  concentrations.'* 

MBE  is  widely  employed  to  synthesize  8  doping  in  semi¬ 
conductors,  such  as  GaAs  and  Si.  In  6-doped  GaAs,  the 
Ga  flux  is  normally  turned  off  to  interrupt  the  GaAs 
growth  and  the  dopant  shutter  is  opened,  while  the  As 
shutter  is  kept  open  to  maintain  surface  stoichiometry.  In 
other  words,  the  growth-interrupted  surface  is  exposed  to 
both  As  and  dopant  fluxes  during  the  8  doping  growth 
period.  Si  and  Be  are  used  respectively  as  donor  and  ac¬ 
ceptor  species  in  6-doped  GaAs.  Currently,  MBE  systems 
use  tetrameric  (AS4)  and/or  dimeric  (AS2)  arsenic 
sources.  In  practice,  the  As-  cell  consists  of  an  AS4  crack¬ 
ing  section  and  a  sublimation  section.’  The  As  flux  from 
the  A$2  cracking  source  in  fact  contains  a  fraction  of  un¬ 
cracked  AS4,  and  by  adjusting  the  cracking  temperature 
the  fraction  of  AS2  in  the  As  flux  can  be  varied.  The  growth 
mechanisms  of  GaAs  with  AS2  and  AS4  are  different,  being 
first-order*  and  second-order  processes,’  respectively.  The 
A$2  source  has  some  advantages  over  the  AS4  source  in 
crystal  growth;’  however,  for  homogeneous  Si  doping  in 
GaAs,  it  has  been  shown  recently  that  the  use  of  a  high 
AS4/AS2  flux  ratio  from  an  AS2  cracking  source  leads  to 


higher  electron  concentrations.*  No  similar  study  has  been 
done  on  6-doped  GaAs  prior  to  the  present  work. 

We  report  the  effects  of  the  AS4/AS2  flux  ratio  on  Si  and 
Be  S-doped  GaAs.  Hall  measurements  are  employed  to 
measure  the  free-carrier  concentrations,  and  the  spatial  ex¬ 
tents  of  the  carrier  distributions  arc  characterized  by  C-F 
measurements. 

H.  EXPERIMENTS 

Samples  were  grown  in  a  Varian  Gen  II  MBE  system 
with  a  structure  consisting  of  a  5000  A  undc^jd  GaAs 
smoothing  layer  grown  at  600 ’C  and  a  6-doping  layer 
sandwiched  between  two  1000-A-thick  GaAs  layers  grown 
at  600  or  550  *C.  During  the  6-doping  growth  period,  the 
Si  (Be)  shutter  was  open  for  30  ( 10)  s.  The  GaAs  growth 
rate  was  1.0  ML/s.  The  AS4/AS2  flux  ratio  was  adjusted  by 
changing  the  cracker  furnace  temperature,  as  measured  by 
a  pyrometer.  The  As-to-Ga  incorporation  ratio  is  kept  con¬ 
stant  at  1.7  by  adjusting  the  sublimator  power.  Figure  1 
shows  the  cracker  furnace  temperature  and  the  As  beam 
equivalent  pressure  (BEP)  as  functions  of  the  voltage  of 
the  cracker  power  supply.  As  shown  in  Fig.  1,  the  cracker 
furnace  temperature  is  roughly  proportional  to  the  voltage 
applied  to  the  cracker  heating  element;  therefore,  subse¬ 
quent  discussions  and  figures  refer  to  the  cracker  voltage  as 
an  indicator  of  temperature.  Since  AS4  has  a  lower  sticking 
coefficient  than  As2,  a  high  As  BEP  would  imply  a  high 
A84/AS2  flux  ratio  for  constant  As-to-Ga  incorporation  ra¬ 
tio,  which  corresponds  to  low  cracker  voltages.  Residual 
gas  analysis  (RGA)  also  confirmed  that  the  relative 
amount  of  A54  in  the  As  flux  increases  as  the  cracker  volt¬ 
age  decreases.  Hall  measurements  were  performed  at  room 
temperature  and  no  significant  carrier  freezeout  was  ob¬ 
served  at  low  temperature  (77  K).  An  HP4194  Gain- 
phase/impedance  analyzer  was  employed  to  measure  C-F 
profiles  at  frequencies  ranging  from  100  kHz  to  1  MHz. 

III.  RESULTS  AND  DISCUSSION 

Figure  2  shows  the  two-dimensional  free-carrier  concen¬ 
tration  as  a  function  of  the  cracker  voltage  at  substrate 
temperatures  of  550  and  600  *C  for  Si  and  Be  6  doping. 
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Fig.  I  The  As  BEP  and  me  cracker  furnace  temperaiure  as  funcoora  of 
the  cracker  voltage 

respectivcJy.  No  significant  dependence  of  the  earner  con¬ 
centration  on  the  substrate  temperature  is  observed  It  can 
be  seen  that  as  the  cracker  voltage  increases  ( As*/As>  flux 
ratio  decreases)  the  earner  concentration  decreases  by  as 
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Fig.  2,  The  Hall  concentrations  of  (a)  St  and  (b)  Be  6-doped  GaAs  at 
substrate  temperatures  of  550  and  600  *C. 
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Fl<r  1  The  f  WHMs  of  the  C  -f'  ptoliles  of  Ss  A-doped  GaAs  grown  »t 
nOO  and  55<i  X 


much  as  a  factor  of  2  for  both  types  of  doping  The  Hall 
mobilities  arc  nearly  constani  at  about  2300  and  100 
cm*/V  s  in  Si  and  Be  A-doped  samples,  respectively,  for  all 
cracker  voltages  These  observations  suggest  that  dunng 
the  A-doping  growth  period  As^  molecules  may  have  the 
following  effects  versus  As,  molecules  1 1 )  higher  incorpo¬ 
ration  of  the  dopant  atoms  onto  ihe  surface  or  ( 2 )  a  higher 
electncai  activation  or  less  autocompensation  of  the  dop¬ 
ants  after  the  deposition  of  the  A-doping  layer  In  Be  A- 
doped  samples,  it  is  unlikely  that  the  electrical  activation  of 
Be  atoms  depends  on  the  species  of  As  after  the  A-doping 
growth  penod.  Previous  work*  also  indicates  that  auto- 
compensation  of  Si  in  GaAs  is  a  negligible  effect.  There- 


Cracker  Voluge  (V) 


Fig.  4.  The  FW  HMs  of  the  C-F  profiles  of  Be  A-doped  GaAs  with 
growth  temperaiures  of  600  and  550  "C 
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fore,  the  first  possibility  is  the  most  probable  explanation  of 
the  experimental  results  Since  As^  is  more  reactive  at  the 
surface  than  As^,  it  may  be  that  the  formation  of  volatile 
molecules  involving  Asi  and  the  dopants  can  cause  the 
reduction  of  dopant  incorporation  in  the  crystal. 

C~y  measurements  confirm  that  the  dopant  incorpora¬ 
tion  IS  higher  for  high  AS4/AS2  fiux  ratio.  As  shown  in  Fig. 
3,  the  FWHM  of  the  C-F  profiles  of  Si  6-doped  GaAs 
increases  as  the  cracker  voltage  decreases.  This  might  be 
expected  from  the  higher  incorporation  of  dopants  as  the 
AS4  in  the  As  flux  increases.  As  a  result,  the  confinement  of 
carriers  in  the  dopant-induced  potential  well  is  enhanced 
due  to  a  larger  number  of  ionized  dopants  and  therefore 
stronger  Coulombic  attractive  interactions.  At  low  growth 
temperatures,  the  C-V  profiles  show  less  dependence  on 
the  cracker  voltage  because  there  is  less  As  reevaporation. 
Similarly,  in  Be  d-doped  GaAs,  as  shown  in  Fig.  4,  the 
FWHM  of  the  C-V  profile  decreases  as  the  cracker  voltage 
decreases.  At  low  cracker  voltages  where  the  resulting  dop¬ 
ant  concentration  is  higher,  however,  the  width  of  the  C-y 
profile  increases.  This  results  from  the  fact  that  at  very 
high  Be  concentration,  the  repulsive  Coulombic  interac¬ 
tions  between  the  ionized  Be  acceptors  become  significant.'* 
As  a  result,  the  dopant  distributions  and  thus  the  carrier 


distributions  are  broadened  when  the  spacing  between  Be 
acceptors  reaches  a  minimum. 
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Scanning  tunneling  microscopy  and  spectroscopy  is  used  to  study  GaAs  multiple  pn  junction 
samples  cleaved  in  ultrahigh  vacuum.  Direct  topographic  contrast  over  the  pn  junctions  can  be 
observed  in  the  constant  current  imaging  mode.  The  topographic  height  in  the  p-type  regions 
appears  much  lower  (by  about  5  A)  than  that  in  the  /i-type  regions.  Tunneling  spectroscopy 
measurements  show  consistency  with  the  assignment  of  the  p-  and  n-type  regions.  We  discuss  a 
possible  mechanism  for  the  observed  contrast. 


Recently,  there  have  been  numerous  applications  of 
scanning  tunneling  microscopy  (STM)  to  study  semicon¬ 
ductor  junctions  including  homo-  and  heterojunctions. 
The  most  appealing  feature  of  this  approach  is  the  ability 
to  simultaneously  investigate  both  the  structural  and  elec¬ 
tronic  properties  at  atomic  or  nanometer  resolution.  The 
understanding  of  these  properties  is  very  important  for  the 
development  of  ultrasmall  devices  for  modern  electronics. 

Because  STM  is  a  surface  sensitive  technique,  one 
needs  to  create  a  fresh  surface  which  exposes  the  junction 
regions.  It  is  generally  agreed  that  the  best  way  to  do  that 
is  to  cleave  samples  under  ultrahigh  vacuum  (UHV).  The 
tunneling  tip  is  then  brought  into  the  junction  region  for 
STM  studies.  Since  the  device  junction  areas  are  usually 
less  than  one  micron  across,  the  major  technical  difficulty 
is  the  precise  positioning  of  the  tip  in  the  junction  region. 
Many  groups  have  therefore  combined  STM  with  a  scan¬ 
ning  electron  microscope  (SEM)''^  to  help  position  the  tip 
in  the  junction  region,  with  most  of  the  studies  concentrat¬ 
ing  on  the  heterojunction  systems.  In  fact,  it  was  only  after 
an  UHV-SEM/STM  combination  had  been  developed  that 
the  first  atomically  resolved  images  of  semiconductor  het¬ 
erojunctions  (AlGaAs/GaAs  system)  was  achieved.* 

As  for  the  semiconductor  homojunctions,  most  of  the 
studies  have  concentrated  on  Si  pn  junctions,^’  although 
very  few  were  studied  with  an  UHV  environment.  Further¬ 
more,  most  of  those  studies  relied  on  biasing  the  p-  and 
n-type  regions  differently  in  order  to  observe  any  contrast 
between  the  junction  regions.  On  the  other  hand,  STM 
studies  of  the  GaAs  pn  junctions  have  never  been  explored 
except  for  a  study  using  scanning  tunneling  potentiome- 
try.* 

STM  studies  of  GaAs  pn  junctions  have  a  major  ad¬ 
vantage  over  those  of  Si  pn  junctions.  Since  the  intrinsic 
surface  states  of  the  cleaved  GaAs  (110)  surface  are  out¬ 
side  the  band  gap,  tunneling  spectroscopy  measures  a  sur¬ 
face  band  gap  which  is  the  same  as  the  bulk  band  gap.^ 
Furthermore,  if  there  are  no  extrinsic  surface  states  to  pin 
the  surface  Fermi  level,  then  the  position  of  the  Fermi  level 
relative  to  the  band  edge  measured  by  the  tunneling  spec¬ 
troscopy  is  the  same  as  that  in  the  buik.^  Thus,  the  idea  of 
mapping  out  the  band  profile  across  the  pn  junction  using 
scanning  tunneling  spectroscopy  is  possible  in  the  cleaved 
( 1 10)  surface  of  GaAs.  This  is  certainly  not  the  case  for  Si. 


In  this  letter,  we  report  scanning  tunneling  microscopy 
and  spectroscopy  (STM/S)  studies  of  GaAs  multiple  pn 
junctions.  To  our  knowledge,  this  is  the  first  STM/S  study 
of  GaAs  pn  junctions.  Our  experiments  were  performed  in 
a  UHV-STM  system  equipped  with  a  unique  dual-axis 
sample  translation  stage  (the  design  and  the  characteristic 
of  which  will  be  deb'ribed  elsewhere).  GaAs  multiple  pn 
junctions  were  grown  using  molecular  beam  epitaxy 
(M6E)  with  varying  doping  densities  and  layer  thick¬ 
nesses.  Si  and  Be  were  used  as  dopants  and  the  layers  were 
grown  at  600  *C  under  As-stablc  conditions.  Samples  were 
cleaved  in  UHV  with  a  base  pressure  of  5  X  10"”  Torr  or 
less  to  create  ( 1 10)  surfaces,  thus  exposing  the  junction 
area.  The  tunneling  tip  was  first  brought  to  the  sample 
within  20  pm  from  the  edge  by  using  an  optical  micro¬ 
scope.  Instead  of  using  a  UHV-SEM/STM  combination, 
we  employed  a  different  edge  finding  method  which  enables 
us  to  locate  the  tip  over  the  junction  regions  very  reliably 
The  details  of  this  method  will  be  described  elsewhere. 

Figure  1  shows  a  small  area  of  a  constant  current  im¬ 
age  taken  at  the  n'^  substrate  right  next  to  the  pn  junction 
regions.  The  image  was  obtained  at  a  tip  bias  of  -f  3  V 
relative  to  the  sample,  thus  corresponding  to  that  of  the  As 
sublattice.’  The  atomically  resolved  image  is  used  to  cali¬ 
brate  the  piezoelectric  scanner  and  to  characterize  the  tip 
before  the  large  area  scan  over  the  junction  is  performed. 
Figure  2(a)  shows  the  grown  device  structure  where  the 
area  enclosed  by  the  dashed  line  corresponds  to  the 
scanned  area  shown  in  Fig.  2(b).  This  three-dimensional 
(3D)  perspective-view  image  was  obtained  at  tip  bias  of 
-I-  2.9  V,  thus  corresponding  to  the  image  of  the  filled  states 
of  the  sample.  As  one  can  clearly  see  in  this  perspective- 
view  image,  there  is  a  large  topographic  height  difference 
between  the  n-type  and  p-type  regions  in  this  constant  cur¬ 
rent  image.  The  topographic  height  in  the  p-type  region 
appears  lower  than  that  in  the  n-type  region.  It  also  shows 
a  dependence  on  either  the  device  width  or  the  doping 
level,  namely,  the  shorter  device  region  (or  higher  doping 
level)  shows  a  higher  contrast.  The  largest  topographic 
height  difference  is  about  S  A.  STM  images  obtained  at  the 
opposite  bias  (  —  2.9  V)  also  show  similar  contrast, 
namely,  the  topographic  height  in  n-type  regions  being 
higher  than  that  in  the  p-type  regions. 

The  assignment  of  the  p-  and  n-type  regions  was  first 


1 104  Appl.  Phys.  Lett.  61  (9).  31  August  1992  0003-6951/92/341 104-03$03.00 


®  1992  American  (nsfrtute  of  Physics  1 104 


FIG.  1.  Top-view  STM  image  acquired  at  the  substrate  right  next  to 
the  pn-junction  regions.  The  image  is  obtained  at  a  positive  tip  bias  of  3  V. 
thus  corresponding  to  the  As  sublattice.  The  scanning  area  is  100  by 

100  A. 

based  on  a  comparison  of  the  length  scale  shown  in  the 
image  with  that  of  the  grown  device  structure.  In  addition, 
tunneling  spectroscopy  was  used  to  verify  our  assignment. 
Two  typical  tunneling  spectra  are  shown  in  Fig.  2(c) 
which  were  acquired  at  the  positions  shown  on  the  image 
2(b)  labeled  as  spots  1  and  2.  The  voltage  scale  on  the  I-V 
spectra  represents  the  voltage  of  the  tip  relative  to  the 
sample;  thus,  positive  voltages  correspond  to  filled  states  of 
the  sample.  The  positions  of  the  band  edges  (also  marked 
on  the  f-V  spectra)  are  determined  as  the  positions  where 
there  is  no  detectable  tunneling  current.  The  error  bar  is 
estimated  to  be  ±0.1  eV.  As  one  can  see,  the  spectra  show 
that  the  Fermi  level  is  closer  to  the  valence  band  edge  in 
the  p-type  region,  whereas  it  is  closer  to  the  conduction 
band  edge  in  the  n-type  region.  However,  our  spectra  show 
a  slightly  larger  band  gap  than  the  bulk  band  gap  of  1 .4  eV. 
The  measured  gap  values  are  1.6±0.2  and  1.8±0.2  eV  in 
the  n-  and  p-type  regions,  respectively.  This  may  be  due  to 
the  effect  of  the  tip-induced  band  bending  in  the  measure¬ 
ment.  Furthermore,  the  shift  in  the  spectra  acquired  at  the 
n-type  region  relative  to  that  acquired  at  the  p-type  region 
is  only  about  0.7  ±0.2  eV  when  one  uses  the  assignment  of 
the  conduction  band  edges.  This  shift  is  smaller  than  the 
built-in  potential.  Nevertheless,  the  spectra  show  qualita¬ 
tive  agreement  with  our  assignment  of  the  p-  and  n-type 
regions. 

We  now  discuss  the  contrast  mechanism.  We  interpret 
our  image  by  considering  both  the  work  function  difference 
in  the  p-  and  n-type  regions  and  the  effect  of  the  tip  in¬ 
duced  band  bending.  Figure  3(a)  shows  energy  diagrams 
of  the  tunnel  junctions  at  the  p-  and  n-type  regions,  respec¬ 
tively.  It  should  be  noted  that  the  length  scale  in  the  tun¬ 
neling  region  (order  of  20  A)  and  the  length  scale  in  the 
band  bending  region  (order  of  a  few  hundred  angstroms) 
is  not  the  same.  The  work  function  at  the  p-type  region  is 
higher  than  that  at  the  n-type  region  by  about  1.4  eV.  With 
the  tip  biased  positively,  the  Fermi  level  of  the  tip  is  lower 
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FIG.  2.  (a)  The  device  structure  of  GaAs  multiple^fl  junctions  grown  by 
molecular  beam  epitaxy.  The  area  enclosed  by  dashed  line  corresponds  to 
the  STM  image  shown  in  (b)  (b)  A  3D  perspective-view  STM  image  of 
the  GaAs  multiple  pn  junctions  acquired  at  a  tip  bias  of  2.9  V.  The 
scanning  area  is  4000  by  4000  A.  (c)  Tunneling  spectra  obtained  on  the 
p-  and  n-type  regions.  The  spectra  were  taken  at  the  locations  labeled  as 
spots  I  and  2  in  the  image. 

than  that  of  the  sample.  This  will  cause  the  band  to  bend 
downward  resulting  in  a  highly  accumulated  layer  in  the 
/j-type  and  depicted  layer  in  the  p-type  region  (until  the 
inversion  occurs).  A  highly  accumulated  layer  in  the  n- 
type  regions  means  that  there  are  appreciable  occupied 
states  in  the  conduction  band.  Since  the  tunneling  occurs 
between  the  occupied  states  of  the  sample  and  the  unoccu¬ 
pied  states  of  the  tip,  the  highest  occupied  states  of  the 
sample  see  the  lowest  tunneling  barrier  ( shown  as  a  thick 
dashed  arrow  in  the  figure).  As  a  result,  the  effective  tun¬ 
neling  barrier  height  for  the  n-type  region  is  substantially 
smaller  than  that  for  the  p-type  region.  This  interpretation 
is  consistent  with  the  higher  topographic  heights  observed 
in  the  n-type  region. 

The  same  interpretation  applies  for  the  case  when  the 
tip  is  negatively  biased.  Figure  3(b)  shows  the  energy  di¬ 
agrams  for  this  case  where  the  depletion  occurs  in  the 
n-type  region  and  the  accumulation  occurs  in  the  p-type 
region.  Since  the  lowest  tunneling  barrier  height  occurs  at 
the  highest  occupied  states  of  the  tip  (shown  by  the  thick 

Gwoelal.  it  05 


FIG  3.  (a)  Energy  diagrams  of  the  tunnel  junctions  with  the  tip  posi¬ 
tively  biased  at  the  p-  and  n-type  regions.  The  tip-induced  band  bendings 
are  also  shown  It  should  be  noted  that  the  length  scale  in  the  tunneling 
region  (order  of  20  A)  and  the  length  scale  in  the  band  bending  region 
(order  of  a  few  hundred  angstroms)  is  not  the  same,  (b)  Same  energy 
diagrams  of  the  tunnel  Junctions  as  those  shown  in  (a)  except  that  the  tip 
negatively  biased. 

dashed  arrow  in  the  figure),  the  effective  tunneling  barrier 
height  is  still  smaller  in  the  /i-type  region.  Thus,  this  inter¬ 
pretation  is  also  consistent  with  the  same  contrast  observed 
in  our  study  when  the  tip  was  biased  negatively. 

Consider  that  l~exp(  —2x5),  where  k  is  related  to  the 
effective  barrier  height  <t>,  as 

The  observed  topographic  height  difference  can  then  be 
expressed,  to  first  order,  as 

A5s:(Ax/x)-5o , 

where  Sq  is  the  average  tip-to-sample  distance.  If  we  esti¬ 
mate  that  A<1>  is  about  1-1.5  eV  then  Ax  is  about  0.15-0.2 
A~'.  A  typical  x  is  1  A~'.  If  we  also  assume  that  Sq  is 
about  15-20  A,  then  A5  is  about  3-4  A  which  is  consistent 
with  our  experimental  value  of  5  A.  The  assumption  of  5o 
being  about  15-20  A  is  based  on  previous  measurements  of 
the  vacuum  barrier  resonance  such  as  those  used  by 
others**^'^  to  determine  the  absolute  tip-to-sample  dis¬ 
tance.  However,  we  do  not  rule  out  the  possibility  that 
there  could  exist  other  explanations  for  the  pn  contrast 
observed  in  our  experiments. 

Our  interpretation  of  the  contrast  mechanism  can  be 
extended  to  explain  the  lack  of  contrast  in  the  constant 
current  image  of  the  Si  pn  junctions  cleaved  in  UHV.’  The 
cleaved  Siflll)  surface  has  a  small  surface  gap  of  about 


0.2  eV,*'  thus  pinning  the  surface  Fermi  level  for  both  p- 
and  n-type  regions  to  within  0.2  eV.  The  lack  of  the  vari¬ 
ation  in  the  surface  Fermi  level  relative  to  the  band  edge 
results  in  little  difference  in  the  effective  tunneling  barrier 
heights  for  the  p-  and  n-type  regions.  As  a  result,  there  is 
’ittle  difference  in  the  observed  topographic  heights.  Inter¬ 
estingly  enough,  there  is  a  very  recent  STM  study  cf  Si  pn 
junctions  in  air'  with  the  oxide  layer  formed  on  the 
cleaved  Si  surface.  The  basic  idea  for  this  work  was  to  use 
sample/oxide  interface  to  replace  the  samplc/vacuum  in¬ 
terface,  thus  to  eliminate  the  surface  Fermi  level  pinning. 
Indeed,  it  was  found  that  there  exist  large  topographic 
height  differences  of  about  5  A  (similar  to  ours)  between 
the  p-type  and  n-type  regions  for  both  the  positive  and 
negative  tip  biases.  However,  it  should  be  noted  that  in  that 
case,  the  tunnel  junction  is  much  less  well-defined. 

In  summary,  we  have  used  scanning  tunneling  nicros- 
copy  and  spectroscopy  to  study  GaAs  multiple  pn  junc¬ 
tions.  Direct  to  topographic  contrast  over  the  pn  can  be 
observed  in  the  constant  current  image.  Tunneling  spec¬ 
troscopy  measurement  shows  consistency  with  the  assign¬ 
ment  of  the  p-  and  n-type  regions.  We  believe  that  the 
primary  mechanism  for  the  observed  contrast  is  due  to  the 
combination  of  the  tip  induced  band  bending  and  the  dif¬ 
ference  in  the  work  function  between  the  p-  and  n-lype 
regions.  Our  study  has  demonstrated  the  great  potential  of 
STM  for  providing  an  understanding  of  the  structural  and 
electronic  properties  of  semiconductor  junctions. 
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A  mechanism  for  achieving  a  first-order  phase  transition  in  the  threshold  characteristics  of  a 
laser  is  discussed,  and  data  demcmstrating  the  effect  are  presented.  It  is  suggested  that  a 
first-order  phase  transition  can  be  realized  through  the  design  of  a  laser  active  region  if  the  gain 
medium  is  localized  to  a  spatial  region  with  longitudinal  dimensions  much  less  than  the 
wavelength  of  the  resulting  laser  light.  Evidence  of  the  first-order  phase  transition  is  presented 
in  the  spontaneous  emission  characteristics  of  a  microcavity  semiconductor  laser,  which  shows 
a  decrease  in  the  separation  energy  of  the  quasi-Fermi  levels  at  threshold,  along  with  hysteresis 
in  the  light  versus  current  characteristics. 


Early  in  the  study  of  lasers  it  was  recognized  that  the 
transition  from  below  to  above  threshold  exhibits  charac¬ 
teristics  that  can  be  associated  with  a  nonequilibrium  phase 
transition.''^  In  most  lasers,  threshold  resembles  a  second- 
order  transition,  but  a  first-order  transition  can  also  be 
achieved  in  a  Fabry-Perot  cavity  laser  system  through  the 
use  of,  for  example,  a  saturable  absorber.*  In  this  letter  we 
suggest  that  a  more  fundamental  means  exists  in  which  a 
first-order  transition  in  laser  threshold  can  be  achieved, 
solely  through  the  passive  cavity  design  and  the  spatial 
distribution  of  the  gain  medium  in  the  cavity.  The  design  of 
such  laser  cavities  has  become  possible  with  the  advent  of 
semiconductor  microcavity  devices,  in  which  lasing  can  be 
achieved  using  a  gain  medium  which  exists  in  the  form  of 
a  heterostructure  quantum  well  (QW)  with  thickness 
much  less  than  its  emission  wavelength.  In  this  letter  we 
present  data  on  such  a  microcavity  laser  in  which  strong 
evidence  of  the  first-order  lasing  transition  has  been  found 
in  the  spontaneous  emission  characteristics  of  the  device’s 
threshold  region.  The  spontaneous  emission  characteristics 
provide  a  direct  means  of  analyzing  the  laser  gain  behavior 
at  threshold,  since  both  the  spontaneous  emission  and  op¬ 
tical  gain  depend  on  the  quasi-Fermi  level  separation  in  the 
QW  active  region  of  the  semiconductor  laser.  Saturation 
behavior  of  the  gain  at  threshold  is  measured  for  the  semi¬ 
conductor  microcavity  laser  and  compared  to  a  more  stan¬ 
dard  longer  cavity  edge-emitting  semiconductor  laser. 
While  the  first-order  lasing  transition  is  demonstrated  for 
the  semiconductor  laser,  we  believe  the  mechanisms  to  be 
quite  general  with  potential  application  to  other  laser  sys¬ 
tems  as  well. 

To  illustrate  the  role  of  the  cavity  and  gain  medium 
design  in  controlling  threshold  characteristics,  we  consider 
a  simple  model  in  which  a  Fabry-Perot  cavity  contains  a 
single  optically  thin  gain  region  with  a  thickness  lg<2.Q 
(gain  path  of  Z^),  where  Aq  is  the  center  emission  wave¬ 
length  of  the  active  medium.  Coherence  in  spontaneous 
emission  has  similar  descriptions  in  either  the  semiclassical 
or  fully  quantum  mechanical  approaches,’  and  it  is  illus¬ 
trative  in  this  discussion  to  consider  the  spontaneous  emis¬ 
sion  from  the  semiclassical  approach.  The  spontaneous 
emission  then  exists  in  the  cavity  in  the  form  of  wave  pack¬ 


ets,  the  coherence  length  of  each  wave  packet  being  deter¬ 
mined  by  some  characteristic  collision  time  which  fully 
breaks  coherence  in  the  spontaneous  emission.^  Since  the 
spontaneous  wave  packet  in  the  cavity  is  a  traveling  wave, 
the  optical  gain  felt  by  the  wave  packet  when  passing 
through  the  gain  region,  will  be  the  same  for  a  fixed 
number  of  excited  dipoles  whether  the  gain  is  confined  to  a 
region  of  lg<2.o>  distributed  over  a  region  of  length 
/f>Ao.  In  either  case  the  electric  field  amplitude  is  averaged 
over  a  cycle  of  oscillation  to  determine  the  photon-dipole 
interaction.  However,  above  threshold  the  gain  from  the 
thin  active  region  can  be  increased  if  the  optically  thin  gain 
region  is  placed  at  a  standing  wave  antinode  of  the  laser 
field  in  the  Fabry-Perot  cavity.  This  is  considerably  differ¬ 
ent  from  the  case  for  which  the  gain  region  is  distributed 
over  a  region  /^xAq  in  which  the  optical  gain  is  identical  for 
the  traveling  wave  packet  and  the  standing  wave  laser  field. 
The  change  comes  quite  simply  from  strategically  locating 
the  active  dipoles  at  a  standing  wave  antinode.  The  result 
of  the  change  is  now  an  optical  gain  which  depends  on  the 
light  field  in  the  cavity  (that  is  a  traveling  wave  due  to 
spontaneous  emission  or  a  standing  wave  due  to  the  laser 
field),  and  a  hysteresis  can  be  expected  to  develop  in  the 
threshold,  characteristic  of  a  first-order  phase  transition. 

Gain  in  the  semiconductor  laser  is  characterized  by  the 
separation  of  the  quasi-Fermi  levels  in  the  QWs,  which  will 
in  turn  be  controlled  by  the  drive  current  and  the  electron- 
hole  recombination  rate.  Figure  I  shows  a  qualitative  plot 
of  the  quasi-Fermi  level  separation  in  a  laser  active  region 
versus  drive  current  for  three  cases;  a  nonlasing  device 
with  no  gain  saturation,  gain  saturation  due  to  a  second- 
order  lasing  transition,  and  gain  saturation  due  to  a  first- 
order  lasing  transition.  When  the  gain  region  is  distributed 
over  a  distance  of  Ig  >  Aq,  the  saturation  in  the  separation 
energy  of  the  electron  and  hole  quasi-Fermi  levels  occurs 
smoothly  at  threshold  due  to  the  increase  of  stimulated 
emission  with  a  clamping  of  the  population  inversion,  rep¬ 
resentative  of  the  second-order  phase  transition.  This  be¬ 
havior  of  the  second-order  phase  transition  is  typical  of  the 
more  standard  edge-emitting  semiconductor  laser  with  a 
distributed  gain  region.  In  the  first-order  case  the  gain  re¬ 
quired  for  lasing  decreases  above  threshold  due  to  the  es- 
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FIG.  I.  Plot  of  the  quasi-Fermi  level  separation  in  a  p-n  junction  QW 
active  region  vs  drive  current  showing  the  behavior  for  a  noniasing  device, 
a  laser  with  a  second-order  transition  at  threshold,  and  a  laser  with  a 
first-order  transition  at  threshold. 

tablishment  of  the  standing  wave  lasing  held.  This  results 
in  a  decrease  of  the  quasi-Fermi  level  separation  with  in¬ 
creasing  drive  current  just  above  threshold  (Fig.  I ).  In  the 
analogy  between  phase  transitions  and  laser  threshold,  the 
population  inversion  for  laser  threshold  plays  the  role  of 
temperature  in  an  equilibrium  phase  transition,  while  the 
coherence  length  in  the  electric  field  is  the  ordering  param¬ 
eter.  The  population  inversion  in  the  semiconductor  laser  is 
given  by  the  quasi-Fermi  level  separation,  so  voltage  mea¬ 
sured  across  the  active  region  of  the  laser  plays  the  role  of 
temperature-  Consequently,  whether  the  phase  transition  is 
first  or  second  order  in  the  threshold  of  a  laser  diode  can  be 
determined  by  driving  the  diode  with  an  ideal  voltage 
source  and  measuring  the  threshold  characteristics  in  the 
light  output.  A  hysteresis  will  be  observable  for  this  ideal 
first-order  case,  due  to  the  negative  resistance  provided  by 
the  laser  diode  junction  impedance  at  threshold  (Fig.  1 }. 
We  note  that  a  hysteresis  in  the  operating  characteristics 
would  not  be  observed  if  the  laser  diode  were  driven  by  a 
current  source  even  for  the  first-order  transition  (from  the 
characteristics  of  Fig.  1 ),  but  instead  an  increased  slope 
efficiency  would  be  measured  just  above  threshold. 

Complete  details  of  the  device  structure  and  operating 
characteristics  can  be  found  in  Ref.  7,  and  we  give  only  a 
brief  description  of  the  devices  here.  The  structures  consist 
of  a  Fabry-Perot  microcavity  composed  of  an  optical 
wavelength  thick  Alo67Ciao33As  spacer  region  clad  on  the 
substrate  side  by  a  19.  S  pair  GaAs/AIAs  distributed  Bragg 
reflector  (DBR),  and  on  the  other  side  by  a  combination  of 
a  3  pair  GaAs/AlAs  DBR  and  4  pair  CaF^/ZnSe  DBR.  At 
the  center  of  the  Alo  «,7Gao  }}As  spacer  region  is  an  active 
region  consisting  of  two  Ing  2Gao  «As  60  A  thick  QWs  with 
60  A  GaAs  barriers  on  either  side.  The  combined  thickness 
of  the  QW  active  region  is  180  A,  compared  to  the  emis¬ 
sion  wavelength  from  the  QWs  of  A'~0.28  /xm  (inside  the 
semiconductor  material).  This  system  easily  satisfies  the 
requirement,  therefore,  of  a  gain  region  which  is  optically 
thin.  Hysteresis  develops  simultaneously  in  both  the  light 
versus  current  and  current  versus  voltage  characteristics  at 
lasing  threshold. 
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FIG.  2.  Light  vs  current  characteristics  for  the  mkrocavity  laser  demon¬ 
strating  hysteresis  in  the  threshold  characteristics  (dashed  curve)  mea¬ 
sured  normal  to  the  cavity,  aitd  also  demonstrating  a  sharp  decrease  in  the 
quasi-Fermi  level  separation  (solid  curve)  for  spontaneous  emission  mea¬ 
sured  out  the  edge  of  the  cavity. 

To  isolate  the  role  of  the  quasi-Fermi  levels  and  gain  at 
lasing  threshold,  we  have  made  a  relative  measurement  of 
the  spontaneous  emission  intensity  from  the  QW  active 
region  versus  current  from  below  to  above  threshold.  The 
spontaneous  emission  characteristics  provide  a  sensitive 
probe  as  to  the  relative  quasi-Fermi  level  behavior  in  the 
laser  active  region,  and  this  technique  has  been  used  by 
others  to  characterize  optical  gain  in  semiconductor  la¬ 
sers."  To  avoid  modulation  elTects  due  to  cavity  filtering, 
the  spontaneous  emission  is  detected  from  the  edge  of  the 
laser.  Figure  2  shows  the  light  versus  current  charactens- 
tics  measured  continuous-wave  (cw)  of  the  spontaneous 
emission  measured  from  the  device  edge  (solid  curve),  as 
well  as  from  the  laser  aperture  (dashed  curve).  The  spon¬ 
taneous  emission  peak  is  at  —0.98  jzm  which  is  also  the 
lasing  wavelength.  As  is  characteristic  of  all  these  micro¬ 
cavity  lasers,  a  hysteresis  exists  in  the  light  versus  current 
curve,  with  the  same  hysteresis  always  simultaneously 
served  in  the  current  versus  voltage  curve.  The  spontane¬ 
ous  edge  emission  of  Fig.  2  is  measured  through  a  spec¬ 
trometer  at  a  wavelength  of  0.96S  fim,  well  away  from  the 
lasing  wavelength,  to  avoid  effects  of  scattered  laser  emis¬ 
sion.  Laser  threshold  occurs  at  18-5  mA  with  the  charac¬ 
teristic  hysteresis  in  the  light  versus  current  curve.  In  the 
cleaving  of  the  device  to  obtain  a  measure  of  the  sponta¬ 
neous  edge  emission,  some  damage  occurred  giving  a  non- 
radiative  current  path  some  distance  from  the  10  pm  laser 
active  region.  This  current  path  increases  the  device 
threshold,  and  is  clearly  observed  in  the  spontaneous  edge 
emission  which  only  “turns  on"  above  9  mA.  An  unmis¬ 
takable  dip  is  measured  in  the  spontaneous  emission  at 
threshold  (solid  curve.  Fig.  2),  indicative  of  the  abrupt 
decrease  in  the  separation  of  the  quasi-Fermi  levels.  A  hys¬ 
teresis  can  be  seen  in  the  switching  of  the  spontaneous 
emission  light  intensity  which  occurs  concomitantly  with 
the  switching  in  the  laser  light  at  threshold.  This  switching 
behavior  in  the  spontaneous  emission  at  threshold  is  ob¬ 
served  up  to  the  shortest  measured  wavelength  9100  A. 
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FIG.  3.  Light  vs  current  characteristics  for  an  edge-emitting  channeled- 
substrate  laser  for  both  the  lasing  wavelength  ai  7856  A.  and  a  much 
shorter  wavelength  of  7700  A  which  shows  ideal  saturation  behavior 
indicative  of  a  second-order  lasing  transition. 

Above  threshold  the  quasi-Fermi  level  separation  increases 
slightly,  with  a  much  smaller  slope  than  that  observed  be¬ 
low  threshold.  We  find  the  same  threshold  behavior  in  the 
spontaneous  emission  for  other  laser  microcavuy  devices 
The  saturation  of  the  quasi-Fermi  level  separation  above 
threshold,  which  shows  a  slight  increase  with  increasing 
drive  current  in  Fig.  2,  will  be  determined  by  intraband 
scattering  rates  relative  to  stimulated  emission  rale  and 
pump  rate.  We  note  that  a  consistent  explanation  can  be 
provided  only  by  considering  the  role  of  the  lasing  mode  on 
threshold  characteristics  for  the  following  reasons.  The  p- 
side  mirror  of  the  laser  is  deposited  using  electron-beam 
evaporation  after  crystal  growth.  Therefore  we  are  able  to 
observe  an  individual  device's  current-voltage  characteris¬ 
tics  before  deposition  of  the  mirror,  and  before  lasing  ac¬ 
tion  is  observed.  The  lasing  photon  energy  lies  below  the 
absorption  gap  of  other  layers  in  the  semiconductor  het¬ 
erostructure,  which  consist  of  GaAs  and  AlAs.  Switching 
behavior  in  the  electrical  and  lasing  characteristics  is  only 
observed  at  threshold  after  the  ;7-5ide  mirror  reflectivity  is 
increased  sufficiently  to  achieve  lasing. 

A  major  drawback  to  microcavity  scmiconducloi  la¬ 
sers  is  the  high  electrical  series  resistance  measured  in  ev¬ 
ery  device  reported  to  date.  This  series  resistance,  which  is 
several  hundreds  of  ohms  at  threshold,  will  cancel  the  neg¬ 
ative  resistance  due  to  Junction  impedance  for  the  first- 
order  lasing  transition.  Because  of  the  biasing  scheme  used 
in  the  devices  reported  here,  the  negative  resistance  of  the 
laser  active  region  is  coupled  to  an  additional  positive  feed¬ 
back  current  path  which  allows  the  direct  observation  of 
the  bistability  in  our  current- voltage  and  light-current 
characteristics  at  threhold.  The  additional  feedback  comes 
from  a  leaky  n-type  blocking  layer  which  funnels  most  of 


the  bias  current  into  a  10  ^m  diam  active  region,  but  which 
also  allows  some  leakage  current  and  carrier  buildup  in  the 
QW  region  surrounding  the  ac :ual  laser  cavity.  At  thresh¬ 
old  the  collapsing  quasi-Fermi  levels  pull  excess  charge 
into  the  10  pm  laser  active  region  from  the  surrounding 
QW  region.  In  this  case  the  negative  resistance  intrinsic  to 
the  laser  active  region  is  "amplified”  by  this  additional 
leakage  current. 

The  switching  behavior  observed  in  the  spontaneous 
emission  characteristics  of  the  microcaviiy  laser  can  be 
compared  to  the  more  typical  behavior  expected  for  edge 
emitting  lasers  with  distributed  gain  regions.  For  compar¬ 
ison,  the  spontaneous  emission  characteristics  measured 
out  the  edge  of  an  AlGaAs-GaAs  channeled  substrate  laser 
are  shown  in  Fig.  3.  The  continuous- wave  edge  emission  is 
shown  for  measurements  both  at  the  lasing  wavelength,  in 
which  threshold  is  observed  at  ~50  mA.  and  at  a  some¬ 
what  shorter  wavelength  of  0.77  pm.  The  spontaneous 
emission  measured  at  0.77  pm  shows  the  ideal  behavior  of 
gain  saturation  expected  for  a  second-order  phase  transi¬ 
tion,  with  no  change  in  spontaneous  emission  intcnsny 
above  threshold. 

In  summary,  we  have  presented  a  mechanism  for 
achieving  a  first-order  phase  transition  m  the  threshold 
characteristics  of  a  laser.  The  mechanism  is  based  on  the 
gain  dependence  on  the  lasing  mode  of  light  in  the  cavity 
when  the  gain  region  is  placed  at  a  standing  wave  antinode 
The  semiconductor  microcaviiy  laser  provides  a  suitable 
system  for  building  such  lasers,  since  high  gam  can  be 
achieved  from  an  optically  thin  active  region  That  such 
characteristics  might  be  tied  to  an  intrinsic  bistability  in 
the  cavity  and  gain  region  makes  the  devices  particularly 
attractive  for  large  optical  systems  due  to  the  potential  for 
high  speed  optical  switching  However,  the  mechanisms  as 
discussed  in  this  letter  are  quite  general,  and  may  serve  to 
impact  the  design  of  other  laser  systems  in  which  a  first- 
order  lasing  transition  is  desired 
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Abstract— Dntik  arc  preaeated  dcmoBstratiBg  optical  switcUaf 
and  memory  ta  a  bistable  vertkal-cavlty  surfacc-emltUag  laser. 
Optical  swiicbiag  from  lasing  to  aoal^ng  or  from  oonlasing 
to  tasiag  in  the  vertical-cavity  las«r  is  demonstrated  nsing  an 
AIGaAs  probe  laser  at  -  0.78  pm. 


WE  have  recently  demonstrated  a  vertical-cavity  sur¬ 
face-emitting  laser  (VCSEL)  that  exhibits  concomitant 
bistable  behavior  in  the  light  versus  current  and  current 
versus  voltage  characteristics  (1).  Such  device  operation  is  of 
considerable  interest  for  applications  of  optical  switching  and 
optical  memory.  It  is  especially  desirable  to  realize  this  type 
of  operation  in  a  VCSEL.  given  the  natural  extension  to  large 
area  two-dimensional  arrays,  and  the  potential  ease  with 
which  such  a  device  geometry  may  be  optically  probed. 
There  have  been  recent  reports  on  optical  switching  in 
VeSEL’s  that  incorporate  pn-pn-layered  structures  |2!  or 
optically  addressable  phototransistors  (3).  The  device  struc¬ 
ture  presented  in  our  work  diflFers  from  those  in  terms  of  the 
coupling  which  exists  between  the  lasing  light  of  the  VCSEL 
active  region  and  the  electrical  switching  in  the  surrounding 
pn-pn-Iayered  structure  [!}.  Bistability  in  this  device  .struc¬ 
ture  has  been  described  in  some  detail  [11.  however,  initial 
attempts  to  optically  switch  the  device  with  an  external  probe 
were  unsuccessful.  Here  we  present  the  first  demonstration  of 
optically  switching  with  an  external  probe,  and  demonstrate 
optical  memory  and  an  optical  inverter. 

The  design  and  fabrication  of  the  VCSEL  studied  in  this 
work  have  been  discussed  previously  [11, 14),  therefore,  only 
a  brief  description  of  the  VCSEL  structure  is  given  here.  The 
epitaxial  layers  of  the  device,  illustrated  in  Fig.  1 ,  are  grown 
in  two  different  growth  sequences  using  molecular  beam 
epitaxy  (MBE).  The  layers  deposited  in  the  initial  growth  are 
composed  of  a  1  ftm  n*  GaAs  buffer,  followed  by  a  21.5 
pair  n-type  quarter-wave  stack  of  GaAs/ A1  As,  a  lower  n-type 
Alo,^7Gao  33AS  confining  layer,  an  undoped  active  region 
consisting  of  two  60  A  InojCa^gAs  pseudomorphic  quan¬ 
tum-wells  (QW’s)  with  three  60  A  GaAs  barriers,  an  upper 
p-type  Alo^,Gao33As  confining  layer,  one  p-type  quarter- 
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Fig.  I.  Schematic  cross-section  (not  to  scale)  of  the  VCSEL  structure  used 
m  th»  work.  Two  modes  of  optical  switching  are  pouible  by  optically 
addressing  either  the  10  am-diameter  VCSEL  active  region  (foltower  mode), 
or  off  of  the  VCSEL  active  region  {invetter  mode) 

wave  pair  of  AlA.s/GaAs.  a  200  A  p-lype  Aloj7Gao3,As 
etch  stop  layer,  and  finally  a  600  A  n-type  GaA-s  current 
blocking  layer.  The  p-  and  n-type  Aloft,Gao  ,,As  confining 
layers  and  QW  active  region  have  a  total  combined  thickness 
of  an  optical  wavelength.  The  p-type  AlAs/GaAs  pair  is 
grown  with  a  graded  superiatticc  between  the  layers  in  order 
to  minimize  series  resistance.  After  selective  etching  steps  to 
define  the  10  pm-diametcr  VCSEL  active  region,  a  second 
MBE  growth  of  two  more  p-type  AlAs/GaAs  quarter-wave 
pairs  is  performed,  again  with  superlattice  grading  layers 
inserted  at  each  Al,  .s/GaAs  interface.  To  form  the  dielectric 
stack  deposited  on  the  p-type  semiconductor  layers  of  Fig.  I, 
we  have  used  either  ZnSe/CaFj  or  Si/SiOj  quarter-wave 
layers.  The  ZnSe/CaF^  combination  has  been  found  to  yield 
significantly  improved  device  performance  [5). 

Light  versus  current  characteristics  of  the  VCSEL  in  the 
operating  region  around  threshold  are  shown  in  Fig.  2. 
Lasing  threshold  is  -  3.4  mA,  and  a  hysteresis  window  of 
~250  /I  A  in  current  exists  in  the  threshold  region  with  a 
window  of  ~  200  in  light  output  power.  The  lasing 
wavelength  is  -0.98  ^m.  A  hysteresis  also  exists  in  the 
current  versus  voltage  characteristics  (not  shown),  providing 
for  -0.5  V  of  hysteresis  from  the  "on”  (lasing)  switch 
point  to  the  "ofT’  (nonlasing)  switch  point  (IJ.  As  we  show 
below,  the  hysteresis  in  the  current  versu.s  voltage  character¬ 
istics  due  to  the  lasing  bistability  makes  it  possible  to  bias  die 
VCSEL  in  the  voltage  hysteresis  window  (or  current  hystere¬ 
sis  window  shown  in  Fig.  2)  and  optically  switch  the  device 
from  either  a  lasing  state  to  a  nonlasing  state  (invetter  mode), 
or  from  a  noniasing  to  a  lasing  state  (follower  mode).  The 
selection  of  either  the  follower  or  inverter  mode  is  made  by 
positioning  the  AlGaAs  laser  probe  beam  either  on  the 


t04MI35/9l$0t.00  ©  1991  IEEE 


HUFFAKER  tf «/.:  OPTICAL  MEMORY 


Cun*nt(mA) 


Fig.  2.  Us^  versus  cumnt  characteiisiks  measured  under  coMimious- 
wave  room-temperantre  operation  of  the  VCSEL  at  threshold  showing  the 
hysteresis.  The  hysteresis  window  is  -  250  s>A  for  current  and  ~  200  r>W 
for  the  light  output. 

VCSEL  active  region  for  the  follower  mode,  or  somewhat  off 
of  the  active  region  for  the  inverter  mode,  as  shown  in  Fig. 
1.  The  ftrilowcr  mode  has  been  achieved  while  driving  the 
VCSEL  with  either  a  current  source  or  a  voltage  source, 
whereas  the  inverter  mode  has  been  achieved  with  only 
current  source  biasing. 

Fig.  3  shows  the  optical  switching  characteristics  of  the 
VCSEL  (grating  in  the  inverter  mode,  with  the  laser  biased 
just  above  the  hysteresis  window.  For  this  biasing  scheme  the 
VCSEL  simply  switches  from  above  threshold  (at  -  0.2  mW 
output  power)  when  the  probe  laser  is  low  to  below  threshold 
(~0.01  mW)  when  the  probe  laser  is  high,  without  exhibit¬ 
ing  memory. 

Fig.  4  demonstrates  the  function  of  optical  memory  using 
the  VCSEL  in  the  optical  follower  mode.  For  the  device 
measured  for  Fig.  4  as  compared  to  the  device  measured  for 
Fig.  3,  a  reduced  contact  dimension  in  which  the  ctmtact 
consists  of  a  metallized  ring  of  60  urn  outer  diameter  and  30 
urn  inner  diameter  has  been  used,  which  greatly  improves  the 
device  response  time.  The  data  of  Fig.  4  demonstrate  optical 
memory  using  3  its  voltage  pulses  (lower  curve.  Fig.  4)  to 
bias  the  VCSEL  below  threshold  but  in  the  hysteresis  win- 
(tow  of  the  voltage  current  curve.  During  the  3  ^s  voltage 
pulse  to  the  VCSEL,  a  200  ns  voltage  pulse  is  applied  to  the 
AlGaAs  probe  laser  (upper  curve.  Fig.  4),  the  optical  pulse 
of  which  (at  a  wavelength  of  0,78  /rm)  triggers  the  VCSEL 
into  the  “on”  (lasing)  state.  The  internal  optical  latching  of 
the  VCSEL  holds  its  output  above  threshold  after  the  200  ns 
pulse  from  the  AlGaAs  probe  laser  goes  low.  The  middle 
curve  of  Fig.  4  shows  the  light  output  of  the  VCSEL  mea¬ 
sured  using  a  Ge  photodiode.  The  VCSEL  turns  on  when 
triggered  by  the  AlGaAs  probe  laser,  and  stays  on  until  the 
end  of  the  voltage  pulse  to  the  VCSEL.  Also  observable  in 
the  lower  curve  of  Fig.  4  is  the  increase  in  current  which 
occurs  for  the  VCSEL  when  it  is  switched  to  the  "on”  state. 
The  increase  in  current  results  from  the  optical  switching  of 
the  pn-pn  regions  of  Fig.  I  by  the  VCSEL  laser  light,  which 
occurs  only  in  the  immediate  vicinity  of  the  VCSEL  10  ^m 
active  region.  The  impedance  change  is  only  observable  in 
these  devices  at  lasing  threshold.  Using  the  identical  biasing 
scheme  of  Fig.  4,  but  with  the  optical  path  between  the 
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Fig.  3.  iMcntiiy  versus  time  for  the  optical  protie  signal  at  a  waveki^  of 
0.78  iitB  <upper  trace)  and  VCSEL  output  at  a  wavelength  of  0.98  iim 
(lower  trace)  for  the  VCSEL  operated  in  the  inverter  mode  (as  Uhistnied  in 
Fig.  I). 
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Fig.  4.  Optical  memory  tlemonsirated  using  the  VCSEL  in  the  follower 
mode  The  upper  trace  shows  the  current  pulse  applied  to  the  AlGaAs  probe 
laser  which  switches  the  VCSEL  into  the  ‘  on"  (la.sing)  slate.  The  lower 
trace  shows  the  VC:SEL  bias  current,  and  the  middle  trace  shows  the  VC;SEL 
light  output.  An  impedance  change  in  the  VCSEL  can  be  wen  in  the  lower 
curve  at  the  lime  of  the  probe  pulse,  due  to  the  switch  from  nonlasing  to 
Using. 

AlGaAs  probe  laser  and  VCSEL  blocked,  no  switching  of  the 
VCSEL  occurs,  as  indicated  by  the  Ge-phouxliodc  re^xmse 
and  the  lack  of  an  impedance  change  in  the  VCSEL. 

Optical  memory  has  been  achieved  with  AlGaAs  probe 
pulses  as  short  as  SO  ns,  which  is  the  limit  of  the  vdtage 
pulser  used  to  drive  the  AlGaAs  probe,  at  the  minimum 
probe  power  required  for  switching.  Delay  time  between  the 
leading  edge  of  the  qitical  pulse  frt»n  the  AlGaAs  probe  and 
optical  switching  of  the  VCSEL  to  the  lasing  state  has  been 
measured  to  be  less  than  10  ns,  also  at  the  minimum  optical 
probe  power  required  for  switching.  A  delay  tinK  of  ~  100 
ns  exists  for  the  leading  edge  of  the  bias  voltage  pulse  to  the 
VCSEL  before  which  the  VCSEL  cannot  be  optically 
switched.  It  is  assumed  that  this  delay  time  arises  due  to. 
parasitic  capaciunce  of  the  pn-pn  region  peripheral  to  the 
VCSEL  active  region  which  does  not  contribute  to  the  (tevice 
bistability,  but  which  must  be  charged  at  the  beginning  of  the 
bias  pulse.  After  the  initial  charging  (on  the  leading  edge  of 
the  VCSEL  bias  pulse)  this  parasitic  capacitance  does  not 
limit  VCSEL  switching  speed. 

In  summary,  we  have  presented  data  which  demcHistrate 
optical  switching  and  optical  memoiy  in  a  bisuble  VCSEL. 
These  functions  are  thought  to  be  important  for  such  future 
applications  as  c^tical  computing.  The  realization  of  these 
Ametions  in  a  VCSEL  is  particularly  attractive  given  the 
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natural  extension  of  this  device  geometrs  to  large  area  two- 
dimensional  arrays. 
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iransislor:  t> pc  A  with  a  liigli  ciiiiitcr  rcsi.slaticc  and  t>pc  H 
with  a  low  emitter  resistance.  The  common  emitter  current 
gain  IS  100  for  both  types  of  devices.  The  cause  for  the  high 
emitter  resistance  for  the  type  A  device  is  a  high  contact  resist¬ 
ance  between  the  polysiiicon  and  aluminium,  which  appeared 
from  Kelvin  structures  on  the  same  wafer.  The  type  B  devices 
were  identically  processed  as  the  type  A  devices  except  for  a 
tungsten-silictde  layer  between  the  aluminium  and  polysiiicon. 
This  greatly  improves  the  contact  resistance  without  affecting 
other  transistor  parameters. 


rig.  4  Emiiti‘r  resistance  as  function  of  forward  bias  for  type  A  tran¬ 
sistor  with  emitter  area  of  Hi)  *  S 

The  measured  emitter  resistance  for  a  type  A  device  with  an 
emitter  area  of  80  x  is  plotted  in  Fig.  4.  It  illustrates 

that  the  calculated  Jt,  is  much  too  high  for  low  due  to 
numerical  errors,  whereas  the  value  at  high  y^,  tends  to  be  too 
low  due  to  high  injection  effects,  fn  the  region  in  between,  a 
rather  constant  value  for  the  emitter  resistance  is  found.  Table 
1  summarises  the  results  obtained  from  the  three  different 
methods.  However,  the  open  collector  method  as  well  as  the 
method  of  Ning  and  Tang  led  to  ambiguous  results.  There¬ 
fore.  the  results  had  to  be  matched  to  the  ones  measured  with 
the  new  method,  which  gave  more  reproducible  results. 


Table  1  MEASURED  EMITTER  RESISTANCES 


R, 

R. 

R, 

New 

Ning  and 

Open 

Device 

method 

Tang 

collector 

Q 

n 

n 

Type  A 

/4j  =  5  X  5/jm^ 

200 

7 

160 

Type  A 

Af  *=  8()  X  5 

40 

36 

40 

Type  B 

Af  =  5  X  5/im^ 

22 

15 

15 

Type  B 

Aj  =  80  X  5  /jm^ 

6 

4 

5 

Cimciusiiiits  ■  Wc  have  proposed  a  new  method  to  determine 
the  emitter  resistance  directly  from  the  Gummel  plot.  This 
simple  method  is  based  on  impact  ionisation  in  the  collector- 
base  depletion  region,  which  balances  out  the  external  base 
current. 

20lh  February  1992 

R.  C.  M.  Wijburg  and  J.  H.  Klootwijic  (MESA  Resein-ch  Institute. 
University  of  Twente,  PO  Box  217,  7500  AE  Enschede,  The 
Netherlands) 
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CASCADABILITY  OF  OPTICALLY  LATCHING 
VERTICAL-CAVITY  SURFACE-EMITTING 
LASER 

D.  L.  HufTaker,  D.  G.  Deppe,  C.  Lei,  T.  J.  Rogers, 
B.  G.  Streetman,  S.  C.  Smith  and  R.  D.  Burnham 


Indexing  terms:  Lasers,  Semiconductor  lasers 

The  cascadability  of  an  optically  latching  vertical-cavtty 
surface-emitting  laser  is  investigated  through  the  use  of  a 
wavelength-tunable  optical  probe.  The  bislable  vertical 
cavity  laser  emits  at  a  wavelength  of  0-98  jon,  and  shows  a 
sensitivity  peak  to  probe  light  generated  from  a  grating- 
tunable  edge-emitting  laser  in  the  0  98  pm  wavelength  range. 
The  sensitivity  peak  occurs  in  a  window  of  ~50A  around 
the  vertical-cavity  lasing  wavelength,  and  corresponds  to  the 
Kahry-Perot  lransmi.s.cion  window  of  the  surface-emitter 


Optical  latching  is  an  important  function  for  the  realisation  of 
optical  memory.  Several  reports  of  various  schemes  to  achieve 
optical  latching  in  vertical-cavity  surface-emitting  lasers 
(VeSELs)  have  been  recently  reported  [1-4],  The  realisation 
of  optical  memory  in  the  VCSEL  is  particularly  attractive 
because  the  device  geometry  is  suitable  for  2-dimensional 
arrays,  and  -allows  convenient  optical  addressing.  Demonstra¬ 
tions  of  wavelength  tuning  across  a  2-dimensional  array  may 
also  be  of  use  in  the  parallel  transmission  of  optical  data  [S]. 
In  the  future  it  may  be  feasible  to  cascade  optically  latching 
VeSELs  of  similar  wavelength.  This  operation  may  be  con¬ 
trasted  with  previously  demonstrated  optical  switching  and 
memory  using  a  VCSEL  probed  at  a  wavelength  outside  of 
the  Fabry-Perot  reflectivity  curve  [2].  Ft  has  recently  been 
emphasised  that  monolithic  integration  of  the  VCSEL  with  a 
separate  pnpn  switch  provides  cascadability  due  to  the  wide 
wavelength  range  over  which  the  pnpn  can  be  switched.  This 
may  prove  to  be  an  inherent  advantage  over  structures  in 
which  the  pnpn  region  is  incorporated  directly  into  the  device 
active  region  [3,  4].  However,  the  integration  of  the  pnjm 
switching  layers  directly  into  the  VCSEL  active  region  offers 
advantages  due  to  the  ease  of  device  fabrication,  potential  for 
improved  device  speed,  and  denser  device  packing.  In  addi¬ 
tion,  wavelength  sensitivity  in  the  VCSEL  switching  may  also 
be  a  desirable  characteristic  in  some  applications.  These  con¬ 
siderations  have  led  us  to  investigate  the  switching  character¬ 
istics  of  a  bistable  VCSEL  using  an  optical  probe  beam  with  a 
wavelength  near  the  lasing  wavelength  of  the  VCSEL.  Optical 
switching  is  demonstrated  at  wavelengths  inside  the  stopband 
of  the  Fabry-Perot  cavity,  with  an  increased  sensitivity  near 
the  Fabry-Perot  resonance. 

Details  of  the  device  structure  and  fabrication  have  been 
described  previously  [8].  The  VCSEL  emits  at  a  wavelength 
of  0-98  ^m  and  demonstrates  hysteresis  in  both  light  against 
current  and  current  against  voltage  characteristics.  Tite  hys¬ 
teresis  in  the  current  against  voltage  characteristics  makes  it 
possible  to  bias  the  VCSEL  in  the  hysteresis  window  and 
optically  switch  the  device  from  a  nonlasing  state  to  a  lasing 
state  [2].  A  schematic  diagram  of  the  optical  setup  used  to 
investigate  the  VCSEL  switching  sensitivity  to  the  probe 
wavelength  is  shown  in  Fig,  I,  The  optical  probe  consists  of 
an  edge-emitting  laser  with  a  single  100 A  Ino  iGaonAs 
quantum  well  active  region,  with  one  of  the  facets  coated  with 


an  antirelleciing  film.  The  probe  wavelength  is  selected 
through  the  use  of  an  external  grating  [7]  The  laser  is  tunable 
from  0  96  to  1  Opm.  Tuning  of  the  probe  laser  with  the  exter¬ 
nal  grating  occurs  in  mode  hops  of  -~2  0A,  which  corre¬ 
sponds  to  weak  reflections  from  the  edge-emitting 
Fabry-Perot  cavity.  Light  emitted  from  the  rear  facet  of  the 
probe  laser  is  focused  onto  the  lO^im  active  region  of  the 
VeSEL,  The  VCSEL  is  mounted  on  a  thermoelectric  cooler 
(TEC)  which  maintains  an  operating  temperature  of  ~285K. 
The  VCSEL  and  probe  output  signals  are  each  collected  by 
.separate  Ge  photodiodes. 

VCSEL 


Fig.  I  Schemaiic  Jiai/ram  o/  experimrnlul  xelup  uffd  (n  drmonslrate 
iii'nadtihfttiy  nf  ti.imu  watt'lfnqth-iurn-ahle  cdue-cmil/mi/ 

prnbe  Ici.ier 

Wavelength  is  selecicd  by  rotating  external  diffraction  grating  to 

p'  ivide  positive  feedback  to  active  region  probe  laser 

Switching  of  the  bistable  VCSEL  with  0-98 pm  wave¬ 
length  tight  is  shown  in  Fig.  2.  where  four  traces  repre.sent 
electrical  bias  pulses  to  both  the  VCSEL  and  the  probe,  and 
the  corresponding  light  output  from  each  device.  The  upper 
trace  shows  the  5ps  pulse  to  the  VCSEL.  which  biases  the 
device  just  below  lasing  threshold  in  the  hysteresis  window 
[8].  During  the  5ps  pul.se  to  the  VCSEL,  a  0-25 ps  bias  pulse 
is  applied  to  the  probe  laser  which  drives  the  probe  laser 
above  threshold,  as  indicated  by  the  two  traces  labelled  probe 
current  and  probe  output.  The  light  pulse  produced  by  the 
probe  laser  at  a  wavelength  0  98  pm  vs  seen  to  switch  the 
VCSEL  inio  its  lasing  stale,  where  it  remains  for  the  duration 
of  the  electrical  biasing  pulse  to  the  VCSEL  as  shown  by  the 
trace  labelled  VCSEL  output. 

There  is  significant  sensitivity  of  the  VCSEL  switching 
response  to  (he  probe  laser  wavclcnglh  in  (his  wavelength 
range,  due  to  ihc  transmission  charaeicnslics  of  the  VCSEL 
Fahrs-Pcrol  cavity.  Fig.  ,7  shows  the  relative  probe  intensity 
..gainst  wavelenglh  needed  In  switeli  llie  VCSEL.  The 
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VCSEL  emission  spectrum  above  threshold  is  also  shown,  to 
specify  its  spectral  location  with  respect  to  the  sensitivity 
peak.  The  data  indicate  that  the  VCSEL  is  most  sensitive  to 
the  probe  beam  near  its  lasing  wavelength,  where  the  probe 
intensity  corresponds  to  ~3  5mW  The  probe  power  required 
for  switching,  therefore,  is  significant  compared  to  the  power 
emitted  by  the  VCSEL  just  above  threshold  which  is 
-~0-2mW  [2].  Flowcver,  we  have  observed  selfswitching  of  the 
VCSEL  in  which  the  light  emitted  from  the  device  below 
threshold  is  eoupted  back  Into  the  VCSEL  by  msanii  sf  an 
external  reflector,  and  triggers  the  switching  process.  If  this 
external  reflector  is  considered  as  an  external  probe  source, 
the  light  power  required  for  switching  is  extremely  small.  It  is 
not  clear  at  this  time  what  the  exact  mechanism  of  this  self¬ 
switching  is,  and  whether  the  switching  is  either  extremely 
wavelength  sensitive  beyond  our  tuning  resolution  of  wave¬ 
length  with  the  external  probe  source,  or  possibly  phase  sensi¬ 
tive. 


Fig.  i  Relalite  probe  i/iirnsiii  {dau>  pmniu  wrsMr>  to  optically 
ywilch  VeStL.  ay  function  of  Hjivlcnijih 

VCSEL  lasmg  specirum  is  suptiimpostd  on  same  wavelength  scale 

for  comparison 

To  investigate  the  mechanism  of  optical  switching  when  (he 
external  probe  laser  is  used,  we  have  studied  the  sensitivity  of 
the  current-voltage  characteristics  to  wavelength  of  the  i.so- 
laled  pnpn  structure  which  is  integrated  with  the  VCSEL  [8]. 
For  this  case  a  region  of  the  pnpn  structure  is  isolated  from 
the  VCSEL  active  region  by  etching  into  the  epitaxial  layers 
creating  a  deep  current  blocking  trough.  Although  the 
current-voltage  characteristics  of  the  isolated  pnpn  layers  are 
quite  sensitive  to  probe  wavelengths  of  -()79^m,  we  have 
found  that  they  arc  insensitive  to  0  98  pm  light  under  the 
forward  biasing  condition  used  in  the  VCSEL  switching 
experiment  This  suggests  that  the  switching  due  to  the 
0  98pm  probe  light  shown  in  Fig.  2  is  likely  triggered  by 
absorption  of  the  probe  light  in  the  InGaAs  quantum  well 
active  region  sufficient  to  drive  the  VCSEL  above  threshold. 

In  summary,  wc  have  presented  data  which  demonstrates 
the  cascadabilily  of  a  optically  latching  VCSEL  at  its  lasing 
wavelength,  and  the  sensitivity  of  the  switching  to  probe  light 
at  other  wavelengths.  These  functions  should  be  useful  m 
future  applications  such  as  optical  computing  and  memory 
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BIT  DECISION  TABLE  FOR  HYPERCUBE 
DECOMPOSITION 

C.  H.  Chang,  H.  K.  Azzam  and  H.  H.  Dediu 

Indexing  terms:  Parallel  processing.  Computers 

A  new  algorithm  for  subcube  assignment  and  decomposition 
is  introduced  to  solve  the  fragmentation  problem  in  hyper¬ 
cube  (n-cubc)  processor  allocation.  The  sharing  density 
vector  introduced  in  the  prime  cube  graph  strategy  [1]  is 
used  as  a  vehicle  to  develop  the  new  algorithm.  A  bit  deci¬ 
sion  table  is  devised  to  provide  the  ke '  parameters  for  deter¬ 
mining  the  decomposition.  It  is  proven  that  the  result  of  the 
algorithm  is  at  least  22%  better  than  the  sharing  density 
vector  approach. 


Terms  and  definitions:  Let  a  be  a  prime  cube  of  dimension  k  in 
a  given  configuration  Cj.  We  denote  the  positions  of  bits  in  a 
that  correspond  to  *s  by  s,,  ...,  s^.  Assume  also  that  a  is 
connected  to  each  clement  of  the  set  C,  =  ...,  fii},  where 

C.  is  a  subset  of  Cf,  and  fliS  arc  prime  cubes,  fhe  zero  index, 
one  index  and  star  index  of  a  at  bit  .v,,  denoted  by  Zt'RG.li). 
ONE,{i)  and  *,(0,  are  integers  equal  to  the  numbers  of  prime 
cubes  in  C,  that  contain  0,  1  and  *,  respectively,  in  bit  s, . 

The  bit  decision  table  provides  a  simplified  view  of  zero  and 
one  patterns  in  a  binary  subcube  representation.  The  first  row 
consists  of  the  bits  of  the  target  prime  cube.  For  each  prime 
cube  adjacent  to  the  target,  a  row  consisting  of  the  bits  of  the 
prime  cube  is  added  to  the  table.  What  follows  is  the  penulti¬ 
mate  row  (/)  consisting  of  two  indication  numbers  and  one 
indication  symbol  per  column  providing  a  summary  of  the 
zero,  one,  and  star  statistics  in  that  column.  The  first  indica¬ 
tion  number  is  min  (Z£RO,(/),  0/VjE,(ij).  Following  the  indica¬ 
tion  number  is  a  symbol  denoted  the  split,  which  is  equal  to  0 
if  ZEROJi)  <  ONEjii),  1  if  ZERO,(i)  >  ONE,{i),  or  •  other¬ 
wise.  The  last  indication  number  is  a  count  of  the  stars  in  the 
corresponding  column,  ‘.(i).  Finally,  the  last  row  (5)  contains 
the  sum  of  the  first  and  last  indication  numbers  of  each 


Example  /  Lcl  f  j.  =  Jx,  /I,  yj.  where  i  -  ***(K)*.  //  =  OOIO”, 
•/  =  00**01,  and  a  task  requiring  a  subeube  of  dimension  3 
arrives.  The  following  bit  decision  table  is  generated  ; 

a  •  •  *  0  0*  (Target) 

fi  0  0  1  0  •  * 

y  0  0  •  *01 

/  010  010  001  001 
S  0  0  1  I 

For  example,  in  the  first  column,  the  first  indication  number  is 
0.  followed  by  the  indication  symbol  I  and  the  indication 
number  0,  which  means  that  in  the  last  row  the  select  value  is 
0  -f  0  =  0.  The  select  value  is  a  better  indicator  in  selecting  a 
subcube  from  a  prime  cube  than  the  sharing  density  vector. 

Star  search  algorithm:  An  algorithm  (we  choose  to  call  star 
search)  for  determining  the  subcube  that  will  be  assigned  to 
the  ask  is  discussed  in  this  Section.  The  algorithm  proceeds  as 
follows; 

(1)  determine  the  zero  index  (ZEROj^i)),  the  one  index 
(ONEjti)),  and  the  star  index  of  each  bit  of  the  target 
prime  cube  a  that  contains  a  star 

(2)  find  the  select  values 

(3)  choose  the  bits  with  the  least  select  values  as  targets  for 
splitting  and  find  the  split  at  that  position 

(4)  substitute  each  bit  selected  in  step  3  with  its  corresponding 
split  (if  *,  choose  randomly);  assign  the  resulting  sutKube  to 
the  task  (this  is  one  of  2'  subcubes,  where  r  —  dimension  of  the 
target  prime  cube-dimension  of  the  task). 

Consider  the  configuration  (a,  p,  y).  where  a  =  00* *00, 
^  =  0**00*,  V  «=  'O*!*©.  For  a  task  requiring  a  subcube  of 
dimension  1,  the  following  bii  i.cision  table  is  formed: 

a  0  0  *  *  0  0  (Target) 

fi  0  •  *  0  0  • 

y  •  0  •  I  *  0 

/  0*2  1»0 

S  2  1 

The  smaller  of  the  two  select  numbers  is  in  column  4.  Hence 
bit  4  should  be  selected  as  target  for  decomposition.  Now  wc 
have  to  choose  from  one  of  the  two  following  subcubes; 
(00*000)  and  (00*100).  The  second  indication  bit  in  the  selec¬ 
ted  column  is  *,  which  means  either  of  these  two  subcubes  can 
be  assigned  to  the  incoming  task. 

The  three  indication  numbers  are  the  key  to  deciding  which 
bii  should  be  decomposed  and  which  subcube  is  allocated  to 
the  task.  The  select  value  is  the  number  of  prime  cubes  lost  in 
case  the  decomposition  is  carried  out  at  the  corresponding  bit 
in  the  target  and  the  task  is  assigned  to  the  subcube  with  the 
second  indication  number  (the  split).  Therefore,  minimising 
this  select  value  assures  the  minimum  loss  of  prime  cubes. 

Performance  enoluolion:  Wc  prove  in  this  Section  that  the 
result  of  our  algorithm  is  22%  or  more  better  than  the  sharing 
density  vector  approach  [1].  Consider  the  ith  and  jth  bits  of 
the  target  prime  cube  «.  For  example,  let  us  assume  that  the 
number  of  prime  cubes  aojacent  to  at  is  10.  Also  assume  that 
the  total  number  of  ones  at  bit  i  in  all  the  adjacent  prime 
cubes  is  4,  the  number  of  zeros  is  2,  and  the  number  of  stars  is 
4.  At  the  same  time  assume  that  the  total  number  of  ones  at 
bit  j  in  all  the  adjacent  prime  cubes  is  5  and  the  number  of 
zeros  is  5.  According  to  the  oD  vector  method  we  s'  ould 
choose  I  although  a  better  decision  to  choose  J  is  detected  by 
the  bit  decision  table. 

When  using  the  bit  decision  table  as  opposed  to  the  SD 
vector  method  an  improvement  occurs  with  the  probability : 

F[K]  =  (probability  of  choosing  i  when  using  the  SD  vector 
method  and  choosing  i  when  the  bit  decision  table  is 
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Electhc  fleldj  applied  perpendicular  to  quinotm  wella  ara  obaervad  to  ahlft  that  aanociawd  alactros  cnmaidon 
energica  and  optical  abaorpoon  odgca  of  the  atruccure.  THia  cfTect  U  known  aa  the  quantum  confined  Stark  effect  or 
QCSE.  Although  the  QCSE  haa  been  atudied  extmaively  for  Hl-V  compound  aemJoonductor  ayttama[l  ].  obeesrvadon  of 
the  QCSE  in  Sii-xOex/Si  mulU^uanatm  wella  (MQW)  haa  rarely  been  elt*d[2,3].  Recently,  room  t«nperatare 
photocurrenu  for p-i-n  photodiodee  comalnlng  an  tstrinalc  region  oomprlaed  of  Slf.^Ocs  MQWi  grown  by  lU^ECVD 
(remote  plaama  enhanced  chemical  vapor  depoaltlon)  have  been  rcported[3}.  The  objective  of  thia  work  ia  to  report  the 
observation  of  a  room  temperature  Start  shift  In  these  Sit.xCex/Si  muld-quannan  svella.  The  absorption  edges  eatimaiad 
from  the  measured  phoiocurrent  of  the  reverse  biased  p-i-n  diodes  ihow  Urge,  linear  shifts  of  die  transidon  energiea  of  the 
MQWs  under  applied  electric  field.  The  experitnera^y  obtained  shift  of  the  tranaitian  energies  tnatches  very  well  with 
that  calculated  from  the  envelope  function  approximation  meihod(3]. 

Figure  1  shows  4  schematic  crosa-section  of  the  device  studied  in  this  wort.  MQWs  grown  on  unstrained 
Sio.gGeoJI  buffer  layers  are  used  to  allow  the  strain  to  be  distributed  symmeiricsily  in  Sio.frOeo.4  and  in  Si  Uyers.  This 
allows  thick  MQW  structures  to  be  grown  without  the  critical  thickness  limiution  over  the  entire  structure  thickness.  The 
band  alignment  of  these  structures  is  suggered  type  H  (i.  c.  the  bend  edges  of  the  narrower  band  gap  Sio.6^^.4  lie  above 

the  respective  band  edges  of  the  wider  gap  Si  (Fig.  2)).  Under  applied  electric  field  (c),  the  change  in  transition  energy 
shift  may  be  expressed  as  AE,  +  A////,  ±  ecL  (  &£/  +  A////,  is  the  shift  of  minimum  bound  sute  energies).  The  last 
term  represents  the  potential  drop  between  the  couer  of  two  adjacent  layers  and  dominates  the  expression.  The  total 
energy  shift  is  nearly  linear,  leading  to  large  transidon  shifts{4).  Figure  3  ahowa  (be  measured  photocurrenis  u  a  function 
of  photon  energy  under  different  electric  fields.  To  eliminate  the  effect  of  Increajed  carrier  transport  due  to  increased  biu, 
the  photocurreni  spectra  were  normalized  with  respect  to  the  wavelength  1190mn  (1042meV),  where  any  changes  in  the 
sigrial  with  biu  very  am  all  contribudoiu  from  the  (^CSE.  For  the  same  mcideni  optical  power,  the  photocutrent  (T)  is 
proportional  to  (l-exp<*<zL))/hv,  where  a  is  the  absorpdon  coefficient.  L  la  the  M(^W  tbickneas  and  hv  is  the  photon 
energy.  For  L«l/a,  a  Is  proportional  to  Div.  Plotting  Div  u  a  funedoo  of  hv  (Figure  4),  It  Is  observed  that  near  8(X) 
meV  (the  estimxtcd  tnmsition  energy  of  the  MQW).  o  varies  u  (hv-Ego)*'^  fw  dlffcxervi  applied  electric  fields.  From 
thii  fit,  the  absorption  edge  (Eg^  may  be  eaiimated  at  different  electric  fielda  A  linear  edge  ahift  towards  lower  energiea 
with  Increased  electric  field  may  be  observed  In  Figure  3.  Figure  5  Illustrate  the  good  agreement  found  berween  the 
experimentally  obtained  shift  and  that  calculated  within  the  framework  of  the  envelope  function  approximadon{5], 
neglecting  intervalley  interaciion.  The  btndoffseu  were  cstimaied  from  the  combiniiion  of  iclf-conais«mi  06  initio 
pseudopoientiai  results[6]  and  the  phenomenological  deformiiian  potendal  theory(7].  To  undemand  the  spectra  ai  higher 
energies,  we  have  subtracted  the  (hv-Ego)^^  dependency  from  the  measured  dua  and  obtained  the  absorption  shape  of  the 
Sio.gGeo  2  buffer  layer  (Figure  6).  The  shape  obtained  in  our  work  is  similar  to  that  of  Braunsiein.  et  al.  [81.  It  is 
interesting  to  note  here  that  the  square  root  dependence  of  absorption  coefficient  on  photon  energy  (which  is  incidentally 
characteristic  of  the  bulk  direct  band  gap  material)  is  different  from  that  of  ref.  2.  erfrete  Part  et  al.  observed  a  square 
dependence.  We  believe  (hat  more  experimenul  informadoa  is  needed  to  undenund  ibe  recombinadon  mechanism  in  type 
n  St].xCex/Si  heierosiTUCturcs, 

In  summary,  we  have  demonstrated  large  linear  quantum  confined  Stark  ahift  in  Sii-xCcx/Si  MQWs  at  room 
temperature  which  may  prove  to  be  tueftil  in  optoelectronic  device  applications. 
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ABSTRACT 

In  most  electronic  bandstructure  calculations  of  semiconductor  superlatiiccs  (SLs),  periodically  extended  boundary 
conditions  are  used.  However  the  presence  of  a  terminating  layer,  which  breaks  the  periodicity  of  the  SL.  is  expected  to 
modify  the  electronic  bandstructure.  In  this  work,  we  report  a  model  for  calculating  the  localized  surface-like  states  of  a 
terminated  SL  which  includes,  for  the  first  time,  the  nonparabolic  nature  of  the  host  bandstructurcs  by  way  of  band  mixing, 
unlike  the  simple  one  band  Kxonig-Penney  type  model.  This  model  is  within  the  framework  of  the  muluband  envelope 
function  approximation,  which  has  proven  to  be  a  useful  and  efficient  method  for  the  calculation  of  SL  electronic 
bandstructure. 


1.  INTRODUCTION 

Semiconductor  superlattices  (SLs)  are  obtained  by  growing  alternately,  layers  of  different  host  semiconductors.  In  the  SL, 
the  charge  carriers  see  in  addition  to  the  normal  crystal  potential,  a  periodic  potential  which  is  due  to  a  series  of  quantum 
wells  (QWs)  and  barriers  due  to  the  presence  of  band  offsets  and  energy  gap  differences  at  the  hcierojunction.  When  there  is 
a  considerable  amount  of  overlap  of  wave  functions  between  neighboring  QWs,  bands  of  allowed  energy  levels  of 
delocalized  states  are  formed  with  the  simultaneous  formation  of  forbidden  minigaps  (MGs).  Superlattices  arc  important 
model  systems  to  study  bulk  crystal  properties,  due  to  the  correspondence  between  a  miniband  of  a  SL  and  a  band  of  energy 
of  a  bulk  crystal.  These  artificial  structures  are  currently  of  great  technological  interest  and  importance  because  of  their 
promise  for  flexible  tuning  of  the  electronic  structure  of  the  resulting  system  by  controlled  modifications  of  parameters 
such  as  layer  thicknesses,  alloy  compositions,  strain,  growth  axis  etc. 

In  order  to  understand  the  origin  of  the  fundamental  physical  phenomena  and  other  interesting  optical  and  transport 
properties  the  SLs  exhibit,  it  is  essential  to  calculate  their  electronic  band  structure.  In  most  band  structure  calculations  of 
SLs,  periodically  extended  boundary  conditions  are  generally  used  and  the  SL  is  looked  upon  as  an  infinite,  uniform  periodic 
sriucture,  so  that  it  is  readily  amenable  to  theoretical  treatment.  In  this  picture  there  are  allowed  bands  of  extended  states 
separated  by  forbidden  gaps.  The  analogous  picture  for  real  bulk  solid  is  also  the  same.  But,  in  a  real  crystal,  due  to  the 
presence  of  the  surface,  a  perturbation  of  the  band  suucture  at  the  surface  is  expected.  In  the  1930s  Tamm'  and  Shockley^ 
modeled  the  presence  of  surface  boundaries  in  solids  and  showed  the  presence  of  stales  inside  the  forbidden  gaps  at  the 
surface.  These  were  called  surface  states.  Although  surface  science  is  one  of  the  most  fertile  fields  in  physics,  these  surface 
states  have  not  been  observed  in  the  pure  form  Tamm  and  Shockley  predicted.^  This  was  due  to  the  simultaneous  presence 
of  many  effects  (such  as  the  presence  of  foreign  atoms)  in  real  surfaces  apart  from  the  cases  Tamm  and  Shockley  studied. 
Also  the  deviation  of  the  surface  potential  from  that  in  the  bulk  was  still  unknown.  Similarly,  in  the  semi-infinite  SL 
picture  where  the  periodicity  of  a  SL  is  broken  by  the  addition  of  an  interface,  localized  surface-like  slates  occur  at  the 
interface.  Recendy  Ohno^  et  al.  have  experimentally  verified  the  existence  of  such  surface  states  by  simulating  an  abrupt 
higher  barrier  internal  "surface"  by  terminating  a  GaAs/AlGaAs  SL  with  an  AlAs  barrier.  They  observed,  for  the  first  time, 
the  presence  of  localized  "Tamm"-like  "surface"  states  (called  SL  surface  states)  inside  the  minigaps. 
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We  believe  superlaitices  can  be  an  excellent  experimental  tool  to  study  different  surface-like  behavior  (  and  to  verify 
different  theories  of  surface  states  )  in  a  periodic  structure  such  as  a  semiconductor  crystal.  This  is  due  to  the  fact  that  SLs 
allow  the  construction  of  an  arbitrary  model  surface  for  a  particular  study.  As  an  example,  the  GuAs/AIGaAs  system 
allows  one  to  realize  and  manipulate  a  wide  range  uf  one  dimensional  potential  profiles  to  simulate  a  inudcl  surface  in  a 
controlled  manner. 


2.  MODEL 

Recently  Steslicka  et  al.^  have  proposed  a  simple  model  to  calculate  these  SL  surface  states.  This  model  is  an  extension  of 
the  one  band  effective  mass  type  method  in  which  the  bandsiructurcs  of  the  host  materials  arc  taken  to  be  parabolic. 
However  this  approach  fails  to  describe  even  the  band  structure  of  infinitely  long  InAs-GaSb  SL^  in  which  the  valence  bund 
maximum  of  GaSb  is  higher  than  the  conduction  band  minimum  of  InAs.  Even  for  wide  band  gap  GoAs-AIGaAs  material 
systems,  the  inclusion  of  band  nonpsuabolicity  has  been  found  to  be  the  primary  reason  for  improved  agreement  with  the 
experimental  results.^  Therefore  for  accurate  determination  of  energy  levels,  band  mixing  i.c.,  band  nonp  jraboliciiy  should 
be  taken  into  account.  In  this  work,  we  report  the  calculation  of  the  SL  surface  states  of  a  terminated  SL  by  including  the 
nonparabolic  nature  of  the  host  bandstructures  by  way  of  band  mixing.  We  found  that  the  results  differ  significantly  from 
those  using  the  simple  parabolic  band  model.  The  approach  used  here  is  similar  to  that  of  the  one  band  model  but  the 
Hamiltonian  and  the  boundary  conditions  used  as  well  as  the  final  results  obtained  are  substantially  different  as  will  be 
shown  below. 


Let  us  suppose  that  a  SL  is  composed  of  host  materials  A  and  B  of  thicknesses  L^.  and  Lb  stacked  aStcrnaicly  one  on  the 
other  and  the  terminated  well  region  is  ended  by  a  higher  potential  (region  C).  perhaps  representing  the  vacuum  region 
(Figure  1).  Since  the  mulliband  Kane  model^  incorporates  both  the  conduction  and  the  valence  bands  taking  into  account 
the  band  mixing,  it  is  most  useful  for  describing  the  band  structure  of  the  host  materials.  This  method  has  been  found  to 
give  more  accurate  match  with  experimental  results  than  any  other  similar  methods  which  neglect  the  band 
nonparabolicity.^  Using  this  Hamiltonian  (i.e.,  following  Bastard’s  envelope  function  approximation  formalism),  the  SL 
dispersion  relationship  is  given  by 

cos(9<l)  =  co8{kAL^)cos{kBLB)  -  ^)ein{k^L^)sin{kBLB) 


E  -  Vs.  = 


ki^P^  r  2  It 

3  lE-Vp,'^  E-Vo.i 


i  =  A,B,C 


(1) 


kA 


f  2  1  If 

\E-Vpj,  £-Ko^il 

E  —  VpB  E  —  VoB  1 

-1 


d  —  La  +  Lc 

BCMi  =  „  „  ~  i  =  A,B.C 

E  -Vp.  E-  Vo, 

where  q  is  the  SL  wave  vector,  mj  is  the  band  edge  effective  mass,  Vsj,  Vpi  and  Voi  are  the  conduction  band  minimum, 
light  hole  band  maximum  and  the  split-off  band  maximum  respectively  of  the  i  layer  (i=A,B,C),  P  is  the  momentum 
matrix  element  and  E  is  the  elecuon  energy.  This  method  is  a  generalization  of  the  one  band  parabolic  Kronig-Penney  type 
model  applied  to  a  multiband  case.  The  boundary  condition  used  are  the  continuity  of  the  envelope  function  and  the  first 
derivative  of  the  envelope  function  divided  by  BCMj  at  a  hetero-interface.  The  last  boundary  condition  reduces  for  the  case 

of  one  band  parabolic  model  to  the  first  derivative  of  the  envelope  function  divided  by  the  effective  mass  at  the  band  extrema 
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point.  When  a  .SL  with  cyclic  boundary  conditions  is  considered,  we  can  apply  on  the  envelope  functions  cl>(z),  the  Bloch 
condition  tt)i'z+id>=cxp(iqjd)cl>(z).  Since  the  envelope  function.^  must  describe  an  electron  having  equal  probability  of  being 
found  in  any  quantum  well  of  the  SL,  all  the  components  of  q  have  to  be  real.  But  when  the  SL  is  terminated,  the  complex 
values  of  the  SL  wave  vector  q  may  describe  slates  that  arc  physically  possible.  The  envelope  function  of  this  type 

decreases  exponentially  as  one  moves  away  from  the  terminating  interface  towards  the  SL,  and  therefore  corresponds  to  a 
localized  Slate.  Suppose  in  the  nth  MG  q  is  given  by  i  qj  +  n  rc/d  (  Qj  >  0.  n=0,l  ,2,3, . ).  For  complex  q,  the  L.H.S 

of  (1)  becomes  (-1)'’  cosh  qjd. 

From  the  transfer  matrix  approach*  and  the  Bloch  theorem,  the  wave  function  at  the  first  well  layer  is  known  apart  from  a 
normalization  coefficient.  Assuming  an  exponentially  decaying  function  in  region  C  (higher  barrier)  and  applying  the  above 
mentioned  boundary  conditions  at  the  interface  of  the  terminated  well  layer  and  region  C,  we  obtain: 


{-l)"exp(-qid) 


sinjkALA)  +  ClcosjkALA) 

C\cosh{kB I^b)  ~  C2sinh{kB Lb) 


kA*iBCMA}  kA*{BCMA) 

kc  *  {BCMc)  '  ks  •  (BCMb) 


(2) 


Now  eliminating  qj  from  (1 )  and  (2)  gives  us  the  surface  states  in  the  nth  MG.  This  solution  depends  on  the  depth  and  the 
width  of  the  QW's  and  on  the  effective  mas.ses  at  the  extrema  points  of  the  bulk  band  structures  of  the  host  materials. 

3.  RESULTS 

Figure  2  shows  the  calculated  energy  levels  of  InAs-GaSb  SL  as  a  function  of  the  SL  period.  Hatched  regions  indicate  the 
allowed  energy  bands  as  was  calculated  in  the  now  classic  paper  by  Bastard.^  The  important  result  is  the  presence  of  the  SL 
surface  suites  inside  the  MGs.  Etich  MG  has  two  energy  levels  due  to  the  presence  of  a  terminating  layer  (taken  as  vacuum). 
The  shape  of  these  SL  surface  states  closely  follow  those  of  the  allowed  energy  bands.  The  values  of  llie  input  parameters 
used  in  this  calculation  were  taken  from^  and  the  value  of  V^c  used  was  4.55  eV.  To  make  comparison  with  the  earlier 
report. wc  have  also  calculated  the  SL  surface  states  of  a  terminated  GaAs-Al^Gaj.xAs  SL  as  a  function  of  the  mole 
fraction  x  as  shown  in  Figure  3.  The  barrier  height  of  the  internal  quantum  wells  changes  as  the  mole  fraction  is  changed. 
As  before  the  hatched  regions  indicate  the  allowed  energy  bands  and  the  dotted  lines,  the  surface  states.  If  these  results  are 
compared  w  ith  those  calculated  from  the  simple  one  band  parabolic  model,  we  notice  a  downward  shift  of  the  allowed 
energy  bands  as  expected,®  and  consequently  a  shift  of  the  SL  surface  stales.  In  addition  to  this,  there  are  two  energy  levels 
found  from  the  nonparabolic  model  as  opposed  to  one  found  from  the  parabolic  model.  In  Figure  4,  the  effect  of  the 
variation  of  the  barrier  width  (keeping  the  well  width  constant)  and  well  width  (barrier  width  constant)  is  shown.  In  both  the 
casc.s,  the  presence  of  the  SL  surface  states  are  shown  inside  the  MGs.  In  Figure  5,  a  comparison  is  made  between  the  SL 
surface  states  calculated  from  a  one  band  parabolic  model  and  the  muliiband  nonparabolic  model.  The  result  clearly  shows 
that  the  inclusion  of  nonparaboliciiy  significantly  affects  the  energy  level  positions  of  the  SL  surface  states  particularly  at 
the  higher  lying  minigaps.  The  terminating  barrier  height  Vjc  was  taken  as  4.07  eV  for  GaAs-Al^Gaj.jjAs  SLs  in  all  the 

above  cases.  In  Figure  6,  the  effect  of  changing  the  terminating  barrier  height  on  the  SL  surface  states  are  shown.  For 
terminating  barrier  heights  greater  than  the  internal  barrier  height,  the  SL  surface  states  arc  found  above  the  minibands  and 
for  terminating  barrier  heights  less  than  that  of  the  internal  barriers  the  surface  states  are  found  below  the  minibands,  the 
details  of  which  will  be  published  elsewhere.  It  should  be  mentioned  here  that  this  result  is  expected  from  the  tight-binding 
picture.^  In  this  picture,  the  isolated  quantum  well  energy  levels  broaden  into  a  miniband  as  the  coupling  between  the  wells 
is  turned  on.  Rut  the  end  quantum  w'cll  state  which  has  a  different  energy  level  from  that  of  the  internal  quantum  wells  (due 
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to  the  difference  in  barrier  heights)  does  not  participate  in  the  formation  of  the  minibands  and  it  becomes  a  surface  state.  So, 
for  terminating  barrier  higher  (lower)  than  the  internal  quantum  well  barrier,  the  surface  state  is  at  higher  (lower)  energy 
than  the  minibands.  Interestingly,  we  found  that  one  of  the  SL  surface  states  in  each  MG  does  not  vary  with  the  variation 
of  the  terminating  potential.  The  other  SL  surface  state  disappears  as  the  height  of  the  terminating  potential  approaches  that 
of  the  internal  barriers.  In  all  these  calculations,  a  two  band  model  using  the  conduction  band  and  the  light  hole  valence 
bands  were  used  and  only  the  electron  energy  levels  are  shown  in  the  figures. 

4.  CONCLUSIONS 

In  summary,  we  have  extended  the  envelope  function  approximation  model  to  calculate  the  SL  surface  electronic  states  of  a 
terminated  SL  to  the  case  of  multiband  Hamiltonian  to  incorporate  the  nonparabolic  nature  of  the  band  strucoire  of  the  host 
materials.  The  surface  electronic  states  derived  from  the  multiband  nonparabolic  band  structure  model  depart  significantly 
from  those  reported  for  the  one  band  parabolic  model.  Also,  two  SL  surface  states  are  obtained  from  the  nonparabolic  model 
rather  than  one  found  from  the  parabolic  band  model.  Therefore  band  mixing  is  important  for  these  SL  surface  states. 
Although,  only  InAs-GaSb  and  GaAs-AlGaAs  SLs  have  been  considered  here,  this  model  can  be  applied  to  any  material 
system  such  as  lattice  matched  Gao.avIno.s^As-InP  SL  terminated  by  Alo.48Jno.52As  barrier.  This  method  can  also  be 
extended  for  the  case  of  SLs  composed  of  complicated  bases  ( triangular,  parabolic,  polytype^  etc.).  We  believe  that  this 
model  should  be  useful  for  accurate  modeling  of  transport  across  a  terminated  SL  and  may  aid  in  the  understanding  and 
studying  of  surface  and  interface  physics. 
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and  L|)  are  the  tnickne&ses  of  region  A  and  B  respectively.  ' 
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Fig.  3:  The  energy  levels  of  a  terminated  G^s-AIxCai.^As 
superlaltice  with  well  widths60  A  and  barrier  widthslS  A. 
The  hatched  regions  are  the  minibands  and  the  dotted  lines 
correspond  to  the  SL  surface  states.  The  origin  of  energy  is 
taken  at  the  conduction  band  minimum  of  QaAs. 


Fig.4:  The  allowed  energy  levels  of  a  terminated  GaAs-AlxCaj.xAs  SL  (a)  as  a  function  of  well  width  with  the 
barrier  width-31  A  and  (b)  as  a  function  of  barrier  width  with  well  width*  130  A.  The  hatched  regions  arc  the 
minibands  and  the  dotted  lines  correspond  to  the  SL  surface  states.  The  origin  of  energy  is  taken  at  the  conduction 
band  minimum  of  CaAs. 
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Fig.  6:  The  allowed  energy  levels  of  a  lerminated  GaAs- 
AlxGaj.xAs  SL  as  a  function  of  the  interface  barrier  height 
at  the  end  quantum  well.  The  well  width  was  taken  as  130  A 
and  the  barrier  width  as  40  A.  The  hatched  regions  arc  tire 
minibands  and  the  dotted  lines  correspond  to  the  SL  surface 
states.  The  origin  of  energy  is  taken  at  the  conduction  band 
minimum  of  GaAs. 


Fig.  5:  The  difference  in  energy  levels  of  the  SL  surface 
states  calculated  using  the  nonparabolic  and  dw  parabolic 
band  model  for  a  GaAs*AlxGai.xAs  SL  terminated  in 
vacuum  as  a  function  of  the  composition  of  AlxCaj.^As. 
1(a)  is  for  the  lower  lying  state  and  1(b)  is  for  the  higher 
lying  state  in  the  first  minigap.  Tlje  corresponding  states  arc 
also  shown  for  the  second  minigap. 


Barrier  height  Vse  at  the  end  quantum  well  (meV) 
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ABSTRACT 


We  have  developed  a  femtosecond  ellipsometer  by  incorporating  ellipsomctric  probe 
optics  into  a  rapid  scan  femtosecond  pump-and-probe  experiment.  The  system  allows  near 
real-time  display  of  the  photo-induced  reflectivity  changes  and  provides  complete 
characterization  of  the  time-varying  dielectric  function.  This  ellipsometer  is  u.sed  ex  situ  to 
characterize  the  femtosecond  respon.se  of  relaxed.  MBE-grown  Si^^Gei.^  alloys  over  the 
complete  composition  range.  The  results  show  that  the  femtoscomd  response  depends 
strongly  oil  alloy  composition  in  optically  thick  samples.  Ge-like  samfdes  (x<0.37)  show  a 
characteristic  two-component  response  which  may  be  caused  by  intervalley  L  P  hole 
scattering  and  impact  ionization.  For  a  given  alloy  composition,  the  picseiKe  of  interfacial 
strain  or  surface  oxidation  strongly  alter  the  femtosecond  re.sponse. 


INTRODUCTION 


Noncontact  optical  probing  methods,  such  as  cw  ellipsoiretry,  are  being  u.sed 
iiKreasingly  for  in  situ  characterization  and  control  of  epitaxial  semiconductor  growth  in  non- 
ultrahigh  vacuum  (UHV)  environments,  where  electron  diffraction  cannot  be  used(lj.  An  in 
situ  monitor  of  the  femtosecond  optical  response  of  semiconductors  could  enrich  the 
diagnostic  power  of  optical  probes,  sitKC  the  ultrafast  relaxation  of  photogenerated  carriers 
can  depend  strongly  on  process-induced  defects,  interface  roughne.ss,  surface  Fermi  level 
piruiing.  and  other  surface  conditions  which  ari.se  during  growth|2|.  Nevertheles.s  long  data 
acquisition  times  have  limited  its  development  as  an  in  situ  diagnostic.  Moreover,  for  some 
important  electronic  materials  such  as  the  SixGe^.;^  alloys,  very  few  quantitative  experiments 
have  measured  the  femtosecond  response  over  the  complete  composition  range,  even  for  ex 
situ  samples. 

In  this  paper  we  present  ex  situ  measurements  of  the  femtosecond  response  of  MBE- 
grown.  optically  thick,  relaxed  epilayers  over  the  complete  alloy  composition  range,  using  an 
unampliTied  colliding  pulse  mode-locked  iCPM)  dye  laser.  In  addition  we  attempt  to 
understand  qualitatively  how  these  responses  arise  from  underlying  bulk  carrier  dynamics. 
Finally  we  show  how  interfacial  strain  within  the  optical  probe  depth  or  surface  oxidation 
radically  alter  the  character  of  the  femtosecond  response.  In  order  to  achieve  complete, 
sensitive  and  rapid  optical  characterization  in  these  measurements,  we  have  developed  a 
prototype  femtosecond  ellipsometer  that  monitors  probe  pulses  reflected  obliquely  from  the 
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samples  dirough  eUipsometric  optics  following  exciiation  of  electron-hole  paiit  by  nonnally 
incident  pump  pulses.  Our  system  achieves  nearly  leal-tirne  display  of  the  feimosecond 
response  by  using  a  novel  "rapid  scan’  techniquel3|  in  which  pump-probe  delay  is  r^iidly 
modulated. 


EXPERIMENT 


A  cavity  dumped  output  (-IMHz)  from  an  unamplified  CPM  laser  of  620nm,  80  fs 
pulses,  is  divided  into  strong  pump  (-Ini)  and  weak  probe  beams  The  probe  beam  is  hirthcr 
divided  into  a  reference  beam  and  a  sample  probe  for  subsetfueni  differetHial  detectxm.  The 
probe  beam  goes  through  a  sequeiKe  of  polarizer,  compeasatur  (quarter  waveplate).  sample, 
and  analyzer.  The  tune  delay  between  pump  and  probe  is  introduced  by  a  retroreflector 
anached  ro  a  rapid  shaker  The  pump-induced  reflectivity  change  is  cotuinuously  digitizej 
and  avenged  by  a  fast  AD  converter  interfaced  to  an  IBM  PC.  No  lock-in  detectioo  is  u.sed 
Real  lime  display  of  reflectivity  change  is  possible  for  Ge  rich  alloys,  where  the  reflectivity 
changes  are  largest.  For  Si  rich  alloys,  about  a  minute  is  usually  requited  to  acquire  the 
reflectivity  changes,  depending  on  the  pump  pulse  energy  With  more  powerful  Ti-sapphire 
femtosecond  lasers  now  widely  available,  we  anticipate  even  greater  ease  in  obtaining  real 
tune  data  acquisition.  From  the  results  for  different  settings  of  the  compensator  angle,  time- 
varying  dielearic  constants  are  calculated.  Further  experiment  and  calculation  detail  on  the 
fen^osccotKl  eUipsometric  procedure  wiU  be  discussed  elsewhere. 


RESULTS  AND  DISCUSSION 


Fig.  I  shows  an  example  of  the  mea.suted  femtosecotxl  response  of  a  senes  of  optically 
thick  Si^Gei.x  alloy  samples  ranging  from  pure  Si  to  pure  Ge.  The  SixCej.,  epUayers 
were  grown  by  MBE  on  Si{l00i  substrates  to  thicknesses  of  approximately  1  pm. 
Consequently,  the  Si  substrate  and  the  subsiraie-epUayer  play  a  negligiNe  role  at  our  probe 
wavelengUi  of  blOnro.  The  measurements  shown  were  performed  in  air  immediately  after  the 
samples  were  grown,  using  pump  energy -I lU,  and  a  relatively  .small  probe  incidence  angle 
^®probe  ~  >•  probe  polarization  tuned  ellipsomeiricaliy  lo  pore  linear  s-polarization 

For  pure  Si  (x=l  ,0)  and  the  Si-rich  (x  i  0  5)  alloys,  the  response  for  these  conditions  consists 
of  a  pulsewidth-limited  reflectivity  decrease  upon  phoioexciiation  (At  =  0)  of  an  electron-hole 
plasma,  foUowed  b;-  a  slow  moootonic  recovery  governed  by  ambipotar  diffusion,  electron- 
phonon  interaction,  arxl  bulk  and  surface  recombinalion(2|.  The  magnitude  of  the  initial 
reflectivity  drop  increases  from  ARyR  -  lO'^  at  x=l  to  AR/R  -  10"^  at  x=fl  5,  consistent  with 
the  increasing  pump  absorption  coefficient.  However,  the  temporal  recovery  pattern  changes 
very  iinie  over  this  composition  range.  The  pump  injects  carriers  erntrely  via  the  indirect  L 
— »  X  valley  transition  in  this  composition  range.  Consequently,  electrons  relax  lo  the  X  valley 
conduction  band  (CBl  minimum,  with  negligible  iniervalley  transfer  of  either  electrons  or 
holes. 

Af  Ge  content  mcreases  further  (x<0.^).  major  changes  in  the  femtosecond  response 
become  evident.  First,  the  initial  signal  magnitude  begins  to  increase  with  Oe  content  ( I  -x)  at 
a  much  faster  rate  than  at  lower  Ge  toncemrations.  This  effect  correlates  with  the  transition 
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Rg.l  FsmIOMoocKi  reflectivity  response  of  the  Si^Qe^.^  aloys.  The  numbers  represent  the  Si 
oontervt  in  each  aloy.  The  reflectivities  at  flmedalay  zero  are  dispiaoed  lor  ctarity. 


frcnn  Si-tike  band  siiuctuie  <X-vaUey  CB  minimum)  to  Ge-like  band  structure  (L  valley  CB 
minimum),  which  brings  about  a  rapid  increase  in  the  pump  absorption  coefficient  a  at  2.0 
eV.  Secondly,  at  the  same  crossover  composition,  a  two-component  picosecond  refleaivity 
response  appears,  evident  as  a  delayed  fall  time  in  the  s=0.37  sample,  and  as  a  prominent 
secondary  reflectivity  minimum  for  pure  Ge.  The  correlation  of  this  effect  with  the  transition 
to  Ge-Iike  band  structure  suggests  that  the  two  must  be  related. 

We  believe  that  the  two-component  response  occurs  because,  as  Ge  content  approaches 
unity,  direct  transitions  across  the  L  valley  increasingly  dominate  the  pump  absorption, 
resulting  in  a  large  concentration  of  electrons  and  holes  around  the  L  point.  For  electrons, 
this  is  the  CB  minimum.  Consequently,  little  further  electron  dynamics  are  expected  in  the 
first  few  picoseconds.  L-point  holes,  on  the  other  hand,  carry  excess  >  I  eV,  which  exceeds 
the  band  gap.  They  can  relax  via  two  processes:  1 )  impact  ionization,  in  which  the  L-point 
hole  scatters  to  f  via  creation  of  an  electron-hole  pair,  and  2)  hole-phonon  scattering,  causing 
transfer  of  the  hole  from  L  to  F  without  creating  an  additional  electron-hole  pair.  Both 
processes  may  contribute  to  the  delayed  reflectivity  decrease  observed  in  Ge-like  samples. 
Impact  ionization  is  a  source  of  delayed,  secondary  carrier  generation.  Furthermofe.  the  L  -t 
r  hole  transfer  decreases  the  hole  effective  mass  fti],*  which  enhances  the  Drude  term 
4ilNe2/m(,*  in  the  evolving  dielectric  function,  thus  also  contributing  to  a  delayed  reflectivity 
decrease. 


b) 


Time  Delay  (fsee) 


Ftg.Z  Enainptes  o>  oWique  incklant  e*p3omelric  probe  meeswament  usipa  pure  Ga  as  a  aample. 

a) Pun\p  induced  refiectivity  changes  foe  dHtefem  compensator  angles.  The  angles  next  to  each  curve 
represent  the  compensator  angle  settings.  Measured  resulis  are  shoiwn  as  sobd  curves.  Calcutaled 
responses  using  the  lime-resoNed  dtolectric  constants  in  b)  are  shown  as  doited  curves.  Probe 
incideni  angle  was  67°.  Polarizer  and  analyzer  were  set  at  0°  and  45°.  respeeSvely. 

b) Caloutaled<Mac«iccanstantchangausingtheresultsorcompensatorangle  at 30°  and70°ina). 


Fig.2a)  shows  an  example  of  complete  characterization  of  the  femtosecortol  response  of 
pure  Ge  using  approximately  the  same  pump  energy  as  in  Fig.l.  but  a  more  oblique  probe 
incident  angle  <0probe  =  *  variety  of  ellipsometric  settings.  For  the  measured 

lespoases  (  solid  line  )  .shown  the  polarizer  and  analyzer  were  set  at  0°  and  45°,  respectively, 
and  the  compensator  angle  was  varied  as  indicaied  in  the  figure  whose  definition  can  be  found 
in  reference  |6|.  The  pumping  condition  is  the  same  for  all  of  the  responses.  For  a  uniform, 
optically  thick  sample,  any  two  of  dtese  responses  can  be  used  to  determine  uniquely  both  the 
real  Atilt)  and  imaginary  A£2(t)  pans  of  the  evolving  dielectric  constant  at  the  probe 
wavelength.  Fig.2b)  shows  A£|(t)  and  A£2(t)  extracted  from  the  measured  responses  at 
^compensator  =  30°  and  70°.  Oearly  the  real  and  imaginary  components  undergo  changes  of 
comparable  magnitude,  showing  that  the  Drude  dielectric  term  from  the  electron-hole  plasma 
and  the  interbsid  absorption  term  both  contribute  strongly  to  the  observed  response,  as 
expected  for  above  gap  probe  {diotons.  The  extracted  AE|(t)  and  Ae2(l)  can  be  used  to 
calculate  the  response  for  other  ellipsometer  settings  as  a  check  of  internal  consistency.  The 
dashed  curves  in  Fig. 2a)  represent  such  calculations,  and  demonstrate  a  high  degree  of 
internal  consistency  in  the  data. 

Fig.3  shows  several  examples  of  the  sensitivity  of  the  femtosecond  response  of 
SijjGei.x  to  sample  conditions.  Curve  a)  in  Fig.3  shows  the  measured  response  of  a  strained, 
optically  thin  (8(him  thick)  sample  of  Siq  75000  25  grown  by  MBE  on  $i(IOO|  under  the 
same  conditions  and  in  the  same  r>— -iber,  as  the  thicker  samples  described  above. 
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Fig.3  ReHedivity  responses  of  strained  and  oxidized  sampies.  a)MBE-gro¥»n  SOnm  thick 
Sig  7sGeQ  25  b)RPCVO-grown  SOnm  thick  Sig  gsOeg^ig.  c)  Sg  5Geo  5  after  exposed  to  ambient  air 
for  severat  months.  The  inset  shovrs  tie  Auger  d^th  prdNIe  analysis  result.  The  btack  dots  represent 
tie  ratio  of  Qe  to  Si  Auger  signal. 


The  recovery  time  is  dramatically  shortened  compared  to  the  relaxed  sample.  Undoubtedly 
this  effect  is  related  to  the  high  defect  density  present  in  the  strained  region  around  the 
Sk).7S^^.25^'  interface,  which  lies  well  within  the  optical  pumping  and  probing  depths. 
The  faster  recovery  can  be  attributed  to  trapping  of  carriers  in  defect  states,  leading  to 
accelerated  recombination.  In  order  to  check  the  reproducibility  of  these  effects,  a  second 
Si(),g5Geg  15  sample  of  the  same  thickness  and  similar  composition  was  grown  on  Sit  l(X)| 
by  a  different  method  -  remote  plasma  chemical  vapor  deposition  (RPCVD).  Curve  b)  in 
Fig.3  shows  that  its  response  is  virtually  indistinguishable  from  that  of  the  MBE-grown 
sample.  This  high  degree  of  reproducibility  suggests  that  the  faster  recovery  is  caused  by 
inherent  features  of  the  strained  interface,  independent  of  details  of  the  growth  procedure  or 
conditions. 

Curve  c)  in  Fig.3  shows  the  measured  femtosecond  response  of  the  Sig  5600  5  .sample 
used  to  acquire  the  Fig.  I  data,  after  the  sample  had  been  exposed  to  ambient  air  at  room 
temperature  for  several  months  Oearly  the  character  of  the  femtosecond  response  has 
changed  completely.  Not  only  has  the  sign  and  magnitude  of  the  initial  reflectivity  response 
changed,  but  a  two-component  response  is  now  clearly  evident,  suggesting  Ge  enrichment 
near  the  surface,  possibly  caused  by  Ge  diffusion  driven  by  the  .surface  oxidation  process|41. 
As  a  lest  of  this  hypothesis,  we  performed  an  Auger  depth  profile  analysis|5|  of  Ge.  Si.  and 
Oxygen  concentration  in  the  oxidized  sample,  shown  in  the  inset.  Enhancements  in  Ge  and 


322 


oxygen  concentration  near  the  surface  are  clearly  evident.  Nevertheless,  we  canttot  at  present 
rule  out  the  possibility  that  the  high  surface  Ge  concentration  was  [Hesent  in  the  as-grown 
sample. 

These  exan^>les  illustrate  that  subtle  changes  in  sample  propenies  aometimes 
significantly  alter  dw  femtosecond  response  of  SijGei.^  samples,  thus  demonstrating  the 
possible  diagnostic  power  of  a  femtosecond  ellipsometer. 
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TIte  new  plasma  diagnostic  technique  of  spectral  btueshifting  of  femtosecond  pulses  is  used  for  the 
first  time  to  analyze  quantitatively  the  ionization  of  noble  gases  under  the  influence  of  intense,  fem¬ 
tosecond  illumination.  The  two  processes  of  strong-field  tunneling  ionization  and  electron  impact  ioniza¬ 
tion  are  found  to  play  competing  roles  on  these  time  scales. 


PACS  numbers:  52.40.Nk.  J2.S0.-l.  J4.80.Dp.  S2.2S.Jm 


Highly  ionized  plasmas  approaching  atmospheric  den¬ 
sity  are  a  potential  future  source  of  coherent  x  rays  (I), 
and  a  potential  medium  for  charged-particle  acceleration 
12}.  Ionization  by  intense,  femtosecond  pulses  holds 
promise  for  precise  control  of  initial  plasma  conditions 
(temperature,  ionization  state,  density)  [3]  which  are 
critical  to  these  applications.  At  the  same  time,  new. 
quantitative,  experimental  diagnostics  compatible  with 
the  high  gas  density  and  ultrafast  time  scale  are  needed 
to  measure  the  ionization  and  subsequent  plasma  dynam¬ 
ics  which  give  rise  to  these  conditions.  A  number  of  au¬ 
thors  have  shown,  both  experimentally  and  theoretically, 
that  a  laser  pulse  which  rapidly  ionizes  a  gas  experiences 
a  frequency  blueshift  caused  by  the  creation  of  a  free- 
electron  plasma  (4,Sl: 


C  •'0  dt 


(1) 


Here,  too  is  the  angular  frequency  of  the  light,  /  is  the 
longitudinal  distance  over  which  the  interaction  occurs; 
n  — (1  ~a/p/a>S)'^'  is  the  index  of  refraction  of  the  medi¬ 
um  through  which  the  pulse  travels,  and  is  found  using 
the  Drude  model  (otp  is  the  plasma  frequency.)  In  exper¬ 
iments  which  used  pulse  durations  long  with  respect  to 
ion-ion  collision  times  (4],  plasma  expansion  and  recom¬ 
bination  following  the  ionization  contributed  to  and  com¬ 
plicated  the  phase  modulation  induced  on  the  pulse.  In  a 
previous  publication  (6l.  we  showed  for  the  first  time  that 
femtosecond  pulses,  tightly  focused  to  intensities  above 
the  ionization  threshold,  experience  a  “pure”  blueshift, 
i.c.,  with  no  trace  of  components  redshifted  from  the  orig¬ 
inal  pulse  spectrum,  indicating  qualitatively  that  the 
phase  modulation  on  the  laser  pulse  is  caused  entirely  by 
ionization.  Furthermore,  complicating  nonlinear  optical 
interactions  with  neutral  gas,  which  can  induce  spectral 
broadening  (supercontinuum)  and  self-focusing  on  fem¬ 
tosecond  pulses  focused  more  loosely  (below  the  ioniza¬ 
tion  threshold)  in  dense  (p  >  40  atm)  gases  l7l,  were 
suppressed  at  near  atmospheric  pressures  because  of  the 
lower  density  and  rapid  plasma  growth  early  in  the  tight¬ 
ly  focused  pulse  [6].  This  preliminary  result  suggested 
that  femtosecond  time-resolved  measurement  and  anal¬ 
ysis  of  the  spectral  shifts  could  measure  detailed  growth 
dynamics  of  the  ionization  front  over  a  broad  range  of  gas 
pressures  and  gas  species,  distinct  from  the  subsequent 


plasma  dynamics  and  optical  nonlinearities  of  the  neutral 
gas.  In  a  wider  context,  such  frequency  upshifts  might 
also  quantitatively  diagnose  plasma  density  oscillations  in 
plasma-based  particle  accelerator  schemes  l8]. 

The  purpose  of  this  Letter  is  to  demonstrate,  for  the 
first  time,  the  quantitative  use  of  this  plasma  diagnostic 
technique  in  analyzing  the  breakdown  of  noble  gases 
caused  by  intense,  femtosecond  illumination.  This  will  be 
done  in  three  stages;  ( I )  We  report  systematic  observa¬ 
tions  of  the  self-blueshifting  of  laser  pulses  after  ionizing 
l-S-atm  pressure  samples  of  He,  Ne,  Ar,  Kr,  and  Xe, 
which  reveal  a  universal,  reproducible  pattern  in  the 
shape  of  the  blueshifted  spectra.  Specifically,  with  in¬ 
creasing  laser  intensity,  gas  pressure,  and  atomic  number, 
the  self-blueshifted  spectra  develop  from  a  near  replica  of 
the  incident  pulse  spectrum  into  a  complex  structure  con¬ 
sisting  of  two  spectral  peaks:  a  narrow  peak  shifted  be¬ 
tween  S  and  10  nm  towards  the  blue  from  the  original 
spectrum,  and  a  broad  peak  shifted  further  towards  the 
blue  whose  position  and  width  depend  strongly  on  the  gas 
pressure,  gas  species,  and  the  laser-pulse  energy.  (2)  We 
report  time-resolved  spectral  shifts  of  a  weak  probing 
pulse  which  show  different  temporal  evolution  for  each  of 
these  two  spectral  features.  (3)  Finally,  we  propose  a 
quantitative,  ab  initio  model  which  relates  these  two  spec¬ 
tral  features  to  two  competing  ionization  mechanisms: 
colliskmless  tunneling  ionization,  predicted  to  dominate 
early  in  the  ionizing  pulse  profile,  and  electron-impact 
ionization,  predicted  to  dominate  in  the  maximum  of  the 
pulse  profile. 

In  our  experiments,  lOO-fs  laser  pulses  with  center 
wavelength  of  620  nm  and  energies  up  to  0.4  mJ  arc  fo¬ 
cused  at  //5  to  a  peak  intensity  of  lo'*-'”  W/cm^  into  a 
glass  cell  containing  1-5  atm  of  He.  Ne.  Ar,  Kr,  or  Xe. 
All  of  the  light  transmitted  through  the  focal  region  is 
collected  and  analyzed  by  a  spectrometer.  Typically,  less 
than  1%  of  the  pulse  energy  is  lost  in  ionizing  the  gas. 
Autocorrelation  measurements  [6(b)l  after  the  interac¬ 
tion  region  show  only  ~10%  temporal  broadening.  Fig¬ 
ure  1(a)  shows  the  measured  spectrum  of  the  ionizing 
pulse  after  breakdown  in  S  atm  Kr.  As  pulse  energy  in¬ 
creases,  the  spectrum  blueshifts  with  little  change  in 
shape  until  the  pulse  energy  reaches  0. 1  mJ  (logio(inten- 
sity)  ■•15.01,  after  which  the  position  of  the  peak  changes 
very  little.  As  the  energy  increases  further,  a  shoulder 
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PIG.  I.  (a)  Spectra  after  interuction  with  5-atm  pressure  Kr 
as  a  function  of  pulse  energy.  Pulse  energy  is  increasing  to¬ 
wards  the  bottom  in  steps  of  x  |0“^  (b)  Spectra  after  interac¬ 
tion  of  0.2S-mJ  pulses  in  5-atm  pressure  of  each  of  the  noble 
gases  studied. 

appears  on  the  blue  side  of  the  spectrum,  then  broadens 
and  shifts  further  towards  the  blue.  Figure  Kb)  shows 
self-shifted  spectra  after  ionization  of  5  atm  of  each  of 
the  noble  gases  using  0.25-mJ  pulses.  The  center  of  the 
unshifted  pulse  spectrum  is  indicated  by  a  vertical  line, 
showing  that  a  blueshift  occurs  in  all  cases.  However,  the 
shape  of  the  blueshifted  spectrum  depends  strongly  on  gas 
species.  In  Ar,  Kr,  and  Xe.  the  narrow,  less-shifted  peak 
and  the  broad,  blue  shoulder  are  clearly  discernible,  the 
latter  becoming  a  separate  peak  in  Kr  and  Xe.  In  He  and 
Ne.  the  blue  shoulder  is  absent,  and  a  different  feature 
—  a  “red  shoulder”  corresponding  approximately  to  the 
unshifted  spectrum — appears  shov/ing  that  some  of  the 
initial  pulse  energy  remains  unshifted.  Although  the  am¬ 
plitude  and  width  of  each  of  these  spectral  features  de¬ 
pend  on  the  focal  profile,  chirp,  and  other  details  of  the 
ionizing  pulse,  numerous  measurements  have  confirmed 
that  the  qualitative  trends  shown  in  Fig.  I  and  described 
above  are  universal,  reproducible  features  of  femtosecond 
ionization  of  the  noble  gases.  Modest  defocusing  of  the 
transmitted  pulse  and  a  visible  breakdown  “spark”  (i.e., 
recombination  luminescence)  always  accompany,  and  are 
precisely  correlated  with,  the  onset  of  the  blueshift.  con¬ 
sistent  with  their  common  origin  in  the  rapid  formation  of 
a  reduced  index  plasma  [6]. 

Time-resolved  pump-probe  experiments  show  that  each 
of  these  spectral  features  also  has  a  characteristic  tem¬ 
poral  evolution  within  the  pump  pulse.  To  obtain  time- 
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FIG.  2.  (a)  Time-resolved  spectra  in  krypton  ai  5  atm  using 
pump  pulse  energy  of  0.22  mJ.  There  are  27  fs  between  spec¬ 
tra:  lime  increases  towards  ihe  top  of  the  figure,  with  negative 
limes  corresponding  to  Ihe  probe  pulse  arriving  before  Ihe  pump 
pulse.  Coincidence  of  the  pump  and  probe  pulses  is  indicated 
by  an  emboldened  spectrum,  (b)  Energies  at  600,  610,  and  61 S 
nm  as  functions  of  time  for  time-resolved  spectra  in  5  atm  Kr. 

resolved  blueshifted  spectra  a  weak  probe  pulse,  derived 
from  the  pump  with  a  beamsplitter,  was  polarized  orthog¬ 
onally  to  the  ionizing  pulse,  then  copropagated  through 
the  focal  region  before  (Ar  <  0),  coincident  with,  or  after 
(Ar  >  0)  the  pump  pulse.  The  probe  was  separated  from 
the  pump  after  the  interaction  region  by  a  polarization 
analyzer,  and  its  spectrum  recorded  for  different  Af.  Fig¬ 
ure  2(a)  shows  a  series  of  time-resolved  probe  spectra  re¬ 
sulting  from  ionization  of  5  atm  Kr  by  a  pump  pulse  cen¬ 
tered  at  A/  "0.  Coincidence  was  determined  to  within 
±20  fs  by  slightly  turning  the  pump  polarization,  and 
then  adjusting  an  optical  delay  line  to  maximize  the  con¬ 
trast  of  the  resulting  pump-probe  interference  fringes. 
The  probe  spectra  around  Ar  ■•0  exhibit  the  same  dual 
structure  seen  in  the  self-shifted  spectrum  (Fig.  1(a)]. 
However,  close  examination  shows  that  the  less-shifted 
peak  and  the  broad  bluer  shoulder  evolve  differently  in 
time.  To  show  this  contrast  quantitatively,  we  have  plot¬ 
ted  in  Fig.  2(b)  the  area  under  the  spectra  in  small  re¬ 
gions  around  615,  610,  and  600  nm.  The  values  at  610 
and  600  nm  display  the  same  temporal  behavior,  with 
maxima  at  approximately  100  fs  before  the  maximum  of 
the  ionizing  pulse,  indicating  that  the  process  giving  rise 
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to  the  blue  shoulder  occurs  in  the  early  part  of  the  pulse. 
The  values  plotted  for  61 5  nm  show  a  maximum  approxi¬ 
mately  50  fs  after  the  maximum  of  the  pump  pulse,  indi¬ 
cating  that  the  process  causing  the  narrow,  less-shifted 
peak  occurs  near  or  slightly  after  the  maximum  of  the 
ionizing  pulse.  A  similar  time  development  is  observed 
for  the  two  corresponding  blueshifted  features  for  Kr  and 
Ar.  The  single  blueshifted  peak  for  He  and  Ne.  on  the 
other  hand,  evolves  early  in  the  pump  profile,  with  little 
or  no  probe  blueshift  observed  for  A/  >  0.  Again,  quanti¬ 
tative  details  of  the  temporal  evolution  depend  on  focal 
profile,  but  the  qualitative  trends  are  observed  reproduci- 
bly  [9]. 

We  now  propose  a  simple,  ab  initio  ionization  model 
which  accounts  for  (I)  the  two  blueshifted  features  ob¬ 
served  in  Ar,  Kr,  and  Xe,  (2)  their  respective  temporal 
behavior,  and  (3)  the  “unshifted”  portions  of  the  spectra 
and  the  absence  of  the  blue  shoulder  observed  in  He  and 
Nc.  In  the  presence  of  a  strong  light  field,  the  ionization 
of  the  atoms  can  be  modeled  using  the  strong-held  tun¬ 
neling  theory  due  to  Keldysh  or  Ammosov,  Delone,  and 
Krainov  (lOl,  which  yield  similar  results  for  the  lower 
stages  of  ionization.  The  coupled  equations  governing  the 
densities  N,  of  atoms  of  a  particular  ionization  state  i  are 
written  in  the  following  way: 

z/A/ 

-  (/>,  _  ,/V,  _ ,  -  A; )  +  (A^o,  -  ,(V  Y.  - 1  -  /Vrff.  )  ■ 


The  first  parenthetical  term  on  the  right  describes  the 
rate  of  growth  of  the  ith  ionization  stage,  where  Ft  is  the 
probability  per  unit  time  (lOl  for  electrons  to  tunnel  from 
k-times  ionized  parent  ions.  The  second  term  describes 
collisional  ionization  via  electron  impact,  where  A,  is  the 
free-electron  density,  tv  •*  fhe  rms  electron  velocity,  and 
<j  is  the  cross  section  (U]  for  the  process.  Intermediate 
re.sonant  states  can  be  ignored  under  our  conditions  {121. 
The  density  A,(t)  of  free  electrons  and  the  resulting  in¬ 
dex  of  refraction  n(/)  calculated  from  Eq.  (2)  yields  a 
calculated  blueshift  upon  substitution  into  Eq.  (I).  The 
model  using  only  the  tunneling  ionization  terms  F*  ade¬ 
quately  describes  the  observed  blueshifting  in  He  and  Nc. 
Because  tunneling  ionization  occurs  almost  entirely  dur¬ 
ing  the  first  half  of  the  laser  pulse,  only  the  leading  edge 
of  the  pulse  becomes  blueshifted;  the  trailing  part  of  the 
pulse  remains  unshifted.  and  appears  as  the  spectral 
■‘red”  shoulder  observed  in  the  He  and  Ne  gas  break¬ 
down. 

At  Ar.  Kr,  and  Xc  pressures  >  1  atm  and  with  max¬ 
imum  pump  intensity,  the  red  unshifted  shoulder  disap¬ 
pears,  and  the  entire  pulse  spectrum  is  shifted  iFig.  I  (b)l. 
This  observation  implies  that  ionization  occurs  during 
and  after  the  peak  of  the  ionizing  pulse,  and  can  be  ex¬ 
plained  by  including  electron-impact  ionization  [the 
second  parenthetical  term  of  Eq.  (2)1.  Collisional  ioniza¬ 
tion  is  strongest  near  the  peak  of  the  laser  pulse  because 
( I )  the  cross  sections  are  maximal  for  electron  energies  in 


the  range  100-500  eV  [11],  very  close  to  the  electron 
quiver  energy  at  the  peak  (370  eV  for  I  O'*  W/cm^);  (2) 
A,  is  large  due  to  the  strong-field  ionization  early  in  the 
pulse:  and  (3)  the  rms  velocity  r  of  free  electrons  is  maxi¬ 
mal  at  the  peak  of  the  pulse.  Stated  from  a  different 
viewpoint,  for  the  first  few  charge  states,  the  thresholds 
for  semiclassicat  barrier  suppression  ionization  (BSD 
(13)  are  less  than  those  for  collisional  ionization  (defined 
as  the  intensity  at  which  the  quiver  energy  equals  the 
low-field  ionization  energy),  while  for  subsequent  ioniza¬ 
tion  stages  the  collisional  threshold  is  reached  before  the 
barrier  suppression  threshold  [131.  Estimates  of  typical 
ionizing  collision  times  in  both  He  and  Ne.  where  obser¬ 
vations  imply  little  or  no  collisional  effect,  yield  r 


FIG.  3.  (a)  Calculated  self-shifted  spectra  for  ionization  of  5 
atm  Kr  by  lOO-fs  pulses  of  varying  peak  intensities,  using  Am¬ 
mosov  strong-field  and  collisional  ionization  terms,  (b)  Calcu¬ 
lated  srtectra  for  ionization  of  5  atm  Kr  without  collisions  in  the 
model,  (c)  Degree  of  ionization  vs  time  calculated  both  with 
and  without  collisions  in  5-atm  Kr  gas  peak  intensity  of  10’^ 
W/cm'. 
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200  fs.  significantly  longer  than  the  light 
pulse  duration.  In  Ar,  Kr,  and  Xe  at  S-atm  pressure, 
however,  these  collision  times  are  significantly  shorter 
than  the  pulse  duration  (r  <30  fs),  resulting  in  signif¬ 
icant  ionization  near  the  peak  of  the  pulse,  and  a  shifting 
of  the  entire  pulse  spectrum. 

Figure  3(a)  shows  the  calculated  spectra  after  break¬ 
down  of  5  atm  Kr,  for  peak  intensities  ranging  from  10'^ 
to  lO'^  W/cm^  calculated  using  Eq.  (2)  with  collisions 
and  a  simplified  cylindrical  interaction  region  with  con¬ 
stant  transverse  intensity  1 1 4],  Figure  3(b)  shows  the 
spectra  calculated  without  collision  ionization:  Signifi¬ 
cant  unshifted  energy  remains  in  these  spectra  as  peak  in¬ 
tensity  increases  {Fig.  3(b)l.  With  collisions  in  the  mod¬ 
el,  the  entire  pulse  spectrum  is  shifted,  following  the  same 
trend  as  observed  in  the  data.  The  broad,  blue  shoulder 
observed  in  the  data  for  Ar,  Kr,  and  Xe  corresponds  to 
strong-held  ionization  (and  thus  has  the  same  origin  as 
the  blucshifts  observed  in  He  and  Ne),  while  the  narrow, 
less-shifted  peak  corresponds  to  slower  collisional  impact 
ionization.  The  calculated  temporal  behavior  of  Ihe  two 
features  further  corroborates  this  interpretation.  Figure 
3(c)  shows  the  calculated  degree  of  ionization  as  a  func¬ 
tion  of  time  in  5  atm  Kr  illuminated  by  a  100-fs  pulse 
with  a  peak  intensity  of  10“  W/cm^  both  with  and 
without  collisions.  Collisional  ionization,  like  the  smaller 
blueshift  which  it  causes,  occurs  during  and  after  the 
peak  of  the  pulse,  and  is  slower  than  strong-held  ioniza¬ 
tion,  which  causes  the  larger  blueshift  early  in  the  pulse. 
This  temporal  behavior  is  also  observed  in  Xe  at  lower 
pressures,  and  in  5  atm  Ar.  Because  of  the  tight  focus 
and  the  early  onset  of  ionization  in  the  pulse  prohle,  the 
calculated  blueshift  is  negligibly  affected  by  including 
self-phase  modulation  caused  by  02  of  the  neutral  gas, 
ions,  and  electrons,  as  shown  in  detail  elsewhere  (6(b), 
I5|. 

In  conclusion,  the  model  predicts  a  high  rate  of  ioniza¬ 
tion  early  in  the  laser  pulse  due  to  collisionless  strong- 
held  ionization,  followed  by  collisional  ionization  at  a 
lower  rate  near  the  peak  of  the  laser  pulse,  in  good  agree¬ 
ment  with  the  data.  A  more  detailed  report  of  the  experi¬ 
ments  and  model  will  be  published  separately  (16]. 
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ni—  —0.02  for  l-atm  triply  ionized  plasma  exceeds  even 
the  hypothetical  nonlinear  index  nf'‘"''l  which  would  be 
induced  by  light  of  intensity  /“lO'*’  W/cm’  interacting 
with  neutral  He,  Ne.  Kr.  and  Ar  (for  Xe. 
svAn^aufij).  using  measured  values  (Lehmeier  et 

al..  Ref.  I7l).  Taking  info  account  the  disappearance  of 
the  neutral  gas  early  in  the  pulse,  and  the  smaller  «?” 
values  of  the  ions,  nf*/  is  small  compared  to  at 

the  peak  of  our  pulse  in  all  the  noble  gases.  As  for  the 
free  electrons,  nl’"'"'”  becomes  significant  only  for  relativ¬ 
istic  quiver  velocities  (/— 10'“  W/cm’). 

(I6l  Wm.  M.  Wood,  C.  W.  Siders.  and  M.  C,  Downer  (to  be 
published).  Wm.  M.  Wood.  Ph.D,  dissertation.  University 
of  Texas  al  Austin,  1991  (unpublished). 
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Optical  properties  of  liquid  carbon  measured  by  femtosecond  spectroscopy 
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A  comprehensive  report  of  femtosecond  time-resc'ved  reflectivity  and  transmission  of  graphite  and  di¬ 
amond  following  optical  excitation  above  critical  melting  fluences  F*  of  0.13  and  0.63  J/cm',  respective¬ 
ly,  is  presented.  Normal-  and  oblique-incidence  reflectivity  has  been  measured  with  100-fs  resolution  at 
wavelengths  ranging  from  700  to  310  nm.  Within  1  ps  following  excitation  above  F„,  probe  reflectance 
increases  sharply  at  visible  frequencies,  remains  nearly  unchanged  at  near-ultraviolet  frquencies,  and  de¬ 
pends  weakly  on  excitation  fluence.  These  optical  changes  are  interpreted  as  an  ultrafast  melting  transi¬ 
tion  from  cry  stalline  graphite  or  diamond  to  a  common,  more  reflective  liquid  state.  During  the  first  pi¬ 
cosecond  following  excitation,  electron  and  lattice  temperatures  substantially  equilibrate,  and  the  lattice 
melts,  before  heat  conducts  out  of  the  absorbing  volume  or  the  surface  hydrodynamically  expands.  A 
Drude  model  of  the  reflectance  spectrum  1  ps  afler  excitation  reveals  a  strongly  damped  plasma  (plasma 
frequency-relaxation  time  product  <i>pr~l),  in  contrast  to  liquid  silicon  (tijpT'-S).  Inferred  electron 
mean  free  paths  approach  the  average  interatomic  spacing  (2  A),  implying  electron  localization.  Optical¬ 
ly  determined  dc  resistivities  up  to  625±75  ^flcm  agree  with  measurements  at  kilobar  ambient  pressure, 
but  significantly  exceed  resistivities  measured  and  calculated  at  low  pressure.  Thus,  the  attribution 
''metal"  is  questionable  for  fluid  carbon  under  these  conditions.  The  results  demonstrate  that  fem¬ 
tosecond  lasers  can  extend  condensed-matter  thermophysics  measurements  to  temperature-pressure  re¬ 
gimes  inaccessible  by  other  methods. 


I.  INTRODUCTION 

Scientific  debate  over  the  properties  of  the  elusive 
liquid  state  of  carbon,  nature’s  most  refractory  material, 
dates  back  to  Ludwig's  1902  report  of  increased  resis¬ 
tance  in  carbon  rods  melted  at  high  pressure.  Extensive 
recent  research* on  high  temperature-pressure  phases 
of  carbon,  which  has  been  reviewed  by  Bundy, ^  reflects 
not  only  the  wide-ranging  importance  of  the  problem  in 
condensed  matter  physics,^  astrophysics,'*  and  geolo- 
gy,'’~‘‘’  but  the  continuing  controversy  over  the  basic 
properties  of  the  liquid  state.  For  example,  much  recent 
debate  has  focused  on  whether  liquid  carbon  is  metallic 
or  insulating,  and  on  how  its  electrical  conductivity  de¬ 
pends  on  temperature  and  pressure.'* ~ Much  of  the 
controversy  stems  from  the  difficulty  of  creating  and 
studying  a  reproducible  sample  of  liquid  carbon  in  the 
laboratory,  particularly  producing  identical  samples  and 
conditions  by  different  methods.  Since  the  melting  tem¬ 
perature  ( ~  5000  K)  exceeds  that  of  any  containment 
vessel,  and  since  the  solid  sublimes  at  low  ambient  pres¬ 
sure,  long-term  confinement  at  elevated  temperature  and 
pressure  is  impossible.  Consequently  experiments  must 
rely  on  transient  melting  and  probing,  using  methods 
such  as  shock  waves,'-  pulsed  ohmic  heating'*  in  high 
pressure  cells, and  pulsed  laser  heating.^  ’  *” 

Recent  pulsed  laser  melting  experiments' *■'*•'  '  were 
the  focus  of  a  particularly  sharp  disagreement  because  of 
the  apparent  contradiction  in  measured  properties.  On 
the  one  hand,  30-ns  laser  pulses  incident  on  highly  orient¬ 
ed  pyrolytic  graphite  fHOPG)  created  melt  depths  which 
Steinbeck  et  al.'  could  explain  only  by  assuming  large, 
metallic  thermal  conductivities  in  the  melt.  This  result 
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agreed  qualitatively  with  metallic  electrical  resistivity 
(p  —  50±20  fiilcm)  measured  by  microsecond  pulsed 
ohmic  heating  of  pyrolytic  graphite  fibers  at  low  pres¬ 
sure.*  On  the  other  hand,  in  time-resolved  optical 
reflectivity  measurements  of  HOPG  melted  and  probed 
by  20-ps  laser  pulses,  Malvezzi  et  al.  *  observed  decreased 
surface  reflectivity  following  excitation  above  a  critical 
melting  fluence,  which  they  interpreted  as  the  formation 
of  an  insulating  liquid  phase.  This  interpretation  was 
supported  by  an  optoelectronic  switching  experiment’  in 
which  picosecond  laser-melted  HOPG  functioned  as  the 
photoconductive  gap  material  in  a  current  transmission 
circuit,  but  challenged  by  authors*  who  argued  that  ma¬ 
terial  ablation  obscured  the  liquid  surface  within  the  20 
ps  pulse  duration,  causing  a  reflectivity  decrease  unrelat¬ 
ed  to  liquid  carbon. 

In  an  earlier  Rapid  Communication,'**  we  briefly  re¬ 
ported  the  first  femtosecond  time-resolved  reflectivity  ex¬ 
periments  on  HOPG.  The  current  paper  is  a  comprehen¬ 
sive  report  of  these  experiments,  augmented  with  expand¬ 
ed  data  and  analysis.  Femtosecond  time  resolution  pro¬ 
vides  the  ultimate  "inertial  confinement"  of  a  laser- 
melted  sample,  because  there  is  no  time  in  the  first  1-2 
ps,  following  femtosecond  excitation  for  atoms  to  move 
away  from  the  surface.  Thus  the  optical  integrity  of  the 
surface  is  preserved,  and  the  question  of  hydrodynamic 
surface  expansion,  or  ablation,  does  not  arise  during  this 
time  interval.  At  the  same  time,  considerable  evidence 
suggests  that  structural  disordering,  or  melting,  of  the  lat- 
tice*'-^^  and  equlibration  of  electron  and  lattice  tempera¬ 
tures^^  progress  substantially  toward  steady-state  during 
this  time  interval.  Thus  a  critical  time  window  exists 
during  which  the  optically  smooth  surface  of  a  quasi- 

2677  ©1992  The  American  Physical  Sociei> 


2678 


D.  H.  REITZE,  H.  AHN,  AND  M.  C  DOWNER 


43 


Steady-State  liquid  car  be  probed.  Indeed  the  fem¬ 
tosecond  experiment'  showed  sharply  increased 
reflectivity  for  several  picoseconds  following  excitation 
above  the  critical  melting  fluence.  which  decayed  to  the 
much  lower  values  observed  in  ps  experiments"  only  after 
time  delays  of  10  ps  or  more.  This  pattern  closely 
matched  the  reflectivity  response  of  the  much  better  un¬ 
derstood  silicon  when  irradiated  at  several  times  the 
melting  fluence  by  a  femtosecond  pulse,  where  it  is  gen¬ 
erally  attributed  to  ultrafast  melting  into  a  metallic  liquid 
followed  by  the  slower  process  of  surface  expansion. 
Thus  these  earlier  fs  experiments  defined  the  time  scale 
during  which  the  optical  properties  of  the  unobscured 
liquid  should  be  measured  following  melting  by  a  fem¬ 
tosecond  pulse. 

In  this  paper  the  measurement  and  analysis  of  the  opti¬ 
cal  properties  of  femtosecond-laser-melted  carbon  during 
the  initial  increased  reflectivity  stage  is  carried  out 
in  much  greater  detail.  Specifically,  time-resolved 
reflectance  data  at  a  range  of  probe  wavelengths 
(700  >>->310  nm),  oblique  incidence  angles,  and  pump 
fluences  is  reported.  Furthermore  a  dielectric  function 
based  on  the  Drude  model  is  .it  to  the  data.  From  the 
model,  Drude  parameters — conduction  electron  density 
and  collision  frequency  —  are  extracted.  There  are  three 
moiivations  for  such  detail  First,  careful  reflectance 
measurements  have  historically  provided  one  of  the  most 
important  experimental  keys  to  the  underlying  electronic 
properties  of  liquid  metals.'*  Second,  the  recent  appear¬ 
ance  of  molecular  dynamics  simulations"  and  a  density 
functional  theory’  of  the  liquid  state  of  carbon  provide 
testable  predictions  of  optical  properties  which  can  be 
compared  directly  to  our  data.  Third,  the  low-frequency 
limit  of  the  optical  dielectric  function  can  be  related  to 
previous  measurements''  and  calculations"  of  the  dc 
electrical  resistivity  of  liquid  carbon.  Our  results  show 
much  higher  resistivity  ip -•  6(X) /ill cm)  than  ohmically 
heated  carbon  at  low  pressure''  itr  liquid  silicon  and  ger¬ 
manium.*  but  agree  closely  with  measured  resistivity 
'^p—  1000  /ill cm)  of  carbon  melted  by  ohmic  heating  at 
high  pressure.'*  Additional  new  points  of  the  current  pa¬ 
per  are  time-resolved  reflectance  data  of  femtose-ond- 
laser-melted  diamond,  which  closely  corroborate  the 
graphite  results,  an  analysis  of  the  morphology  of  the 
damage  spot  produced  above  the  critical  melting  flrence 
and  its  relationship  to  ablation,  and  a  quantitative  model 
of  the  delayed  reflectivity  decrease  as  a  surface  hydro- 
dynamic  expansion. 

The  paper  is  organized  as  follows.  Section  II  discussed 
experimental  methods.  Section  III  presents  experimental 
results  of  three  kinds:  (ai  postmortem  analysis  of  sample 
damage  morphology,  (b)  self-reflectiv  ilv  and  -transmis¬ 
sion  of  90-f>  pulses,  and  to  pump-and-probe  reflectivity 
and  transmission  measurements  of  HOPG  and  diamond 
at  several  probe  wavelengths  and  polarizations.  Section 
IV  discus'C'  the  thermodynamic  conditions  which  exist 
in  the  first  picoseconds  after  intense  femtosecond  excita¬ 
tion:  the  rate  of  hydrodynamic  surface  expansion,  the  es¬ 
timated  extent  of  electronic  and  structural  equilibration 
in  the  melt  prior  to  surface  expansion,  and  the  estimated 
magnitude  and  time  evolution  of  the  transient  tempera¬ 


ture  and  pressure.  Finally  in  Sec.  V,  we  analyze  the  opti¬ 
cal  properties  of  liquid  carbon  by  applying  a  Drude  mod¬ 
el  to  femtosecond  reflectance  measurements  at  Ar  —  I  ps, 
after  excitation,  and  relate  this  analysis  to  other  measure¬ 
ments*  '*  and  theories"  ’  of  the  liquid  state  of  carbon. 

I  t  EXPERIMENTAL  METHODS 

Experiments  were  performed  with  samples  of  highly 
oriented  pyrolytic  graphite  (HOPG)  and  250  pm  thick, 
type  Ila  dOOi  diamond  obtained  from  Dubbledee  Dia¬ 
mond  Corporation.  The  basic  material  properties  of  both 
the  HOPG  and  diamond  are  described  in  detail  else¬ 
where."  Throughout  this  work,  many  of  the  mea¬ 
surements  were  repeated  under  identical  conditions  on  a 
reference  silicon  sample,  which,  because  of  its  structural 
similarity  and  extensive  previous  studies  of  its  fem¬ 
tosecond  melting  dynamics’'  and  equilibrium  liquid 
stale  (see  Ref.  43),  served  both  as  a  calibration  and  an  aid 
in  interpreting  results.  All  measurements  were  made  in 
air.  The  surfaces  of  the  HOPG  samples  (normal  to  the 
graphite  c  axis)  were  prepared  by  using  tape  to  peel  off 
the  first  few  layers  immediately  before  each  set  of  mea¬ 
surements  to  expose  a  fresh  surface.  The  diamond  sur¬ 
faces  were  prepared  by  standard  optical  polishing  tech¬ 
niques.  Laser  pulses  of  90  fs  duration,  centered  at  620 
nm,  and  up  to  0.2  mJ  energy  were  provided  by  a  colliding 
pulse  mode-locked  (CPM)  ring  dye  laser’  ’  followed  by  a 
four  stage  Nd:YAG  pumped  optical  dye  amplifier  sys¬ 
tem'"' operating  at  a  lO-Hz  repetition  rate. 

When  freshly  prepared  surfaces  were  excited  by  a  90-fs, 
620-nm  pulse  above  a  critical  fluence  F,„,  a  damage  spot 
appeared.  The  fluence  of  incident  pump  pulses  was  cali¬ 
brated  by  measuring  their  energy  with  a  calibrated  photo¬ 
diode,  while  measuring  the  focused  spot  size  at  the  sam¬ 
ple  surface  by  monitoring  transmission  through  a  pinhole 
(mounted  in  the  same  plane  as  the  sample)  translated 
across  the  focal  spot.  At  each  of  several  fluences  thus 
calibrated,  a  sampling  of  irradiated  spots  was  examined 
under  Nomarski  and  scanning  electron  microscopes,  and 
was  defined  as  the  minimum  fluence  at  which  any 
change  in  surface  morphology  attributable  to  the  laser 
pulse  (e.g.,  amorphization,  cratering)  was  observed." 
Measured  values  of  F,„  are  reported  and  discussed  in  the 
next  section.  F,„  of  a  reference  silicon  wafer,  measured 
by  the  same  procedure,  agreed  with  the  value  0.1  J/cm’ 
reported  by  other  investigators  for  90-fs,  620-nm 
pulses." 

At  fluences  f  >F,„,  the  diameter  and  depth  of  the 
craters  created  by  irradiation  pulses  was  measured  as  a 
function  of  fluence  in  HOPG,  diamond,  and  the  reference 
silicon  sample.  The  average  radius  of  the  damage  spot 
r,|„„  at  each  fluence  for  a  common  focal  spot  size  was 
determined  by  examining  a  statistical  sampling  of  dam¬ 
age  spots  under  a  Nomarski  microscope.  Damage  crater 
profiles  were  mapped  with  approximately  30-A  resolution 
using  an  Alpha-Step  depth  profiler.  These  depth  profiles 
help  in  estimating  the  amount  of  material  removed  from 
the  surface,  and  thus  in  evaluating  the  role  of  ablation  in 
the  experiments. 

Two  types  of  surface  reflectance  measurements  were 
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performed;  (1)  self-reflectance  of  single  90-fs,  620-nm, 
near-normally  incident  pulses  as  a  function  of  fluence, 
and  ill  pump-probe  measurements  using  a  90-fs,  620-nm, 
normally  incident  pump  pulse  and  90-fs  probe  pulses  of 
varying  wavelength  (3l0<A.p,oj^<700  nm),  polarization, 
and  incidence  angle.  In  addition,  with  diamond, 
transmission  measurements  were  performed.  Self¬ 
reflectivity  measurements  were  performed  on  HOPG  by 
focusing  pulses  to  50-/im  diameter  at  a  10°  angle  of  in¬ 
cidence  onto  the  sample  surface,  collecting  and  imaging^' 
the  reflected  pulses,  then  detecting  only  the  center  of  the 
pulse  profile,  which  was  selectivity  apertured  in  the  image 
plane.  This  procedure  avoided  spatial  averaging  over  a 
transverse  fluence  distribution.  Thirty  shots  were  taken 
at  each  fluence,  and  a  matched  reference  photodiode 
monitored  the  pulse  energy  incident  on  the  sample  to 
provide  normalization  against  shot-to-shot  energy  fluc¬ 
tuations,  permitting  reliable  measurement  of  reflectance 
changes  smaller  than  1%.  The  sample  was  translated  150 
nm  after  each  laser  shot  to  ensure  that  each  shot  interro¬ 
gated  a  fresh  region  of  the  surface.  A  computer  con¬ 
trolled  the  acquisition  and  shot-to-shot  subtraction  of  the 
signal  and  reference  voltages  and  the  movement  of  the 
sample  raster  stage.  The  onset  of  surface  morphology 
changes  provided  in  situ  calibration  of  absolute  laser 
fluence, 

Pump-and-probe  measurements  monitored  the  tem¬ 
poral  evolution  of  reflectivity  changes  induced  by  surface 
melting.  For  these  experiments,  a  beamsplitter  divided 
the  amplified  pulses  into  pump  and  probe  puKses.  The 
pump  beam  was  .sent  through  a  computer  controlled  opti¬ 
cal  delay  line  and  focused  to  a  spot  diameter  of  60  /rm 
(HOPG)  or  30  ^tm  (diamond)  at  normal  incidence  onto 
the  sample  surface.  The  probe  beam  was  attenuated  to 
well  below  F„.  Time-resolved  reflectivity  and  transmis¬ 
sion  of  diamond  was  probed  only  at  near  normal  in¬ 
cidence  with  620-nm  pulses,  because  of  the  limited 
amount  of  sample  available.  Normal  incidence 
reflectivity  of  HOPG  was  probed  not  only  at  620  nm.  but 
at  different  wavelengths  generated  either  by  frequency 
doubling  in  a  thin,  phase  matched  potassium  dihydrogen 
phosphate  (KDP)  crystal  (to  produce  310-nm  pulses)  or 
by  white  light  continuum’*  in  a  jet  of  ethylene  glycol  (to 
generate  pulses  at  7(X)  <k<  500  nm).  Band-pass  filters 
selected  the  appropriate  wavelength  for  detection.  In  ad¬ 
dition  reflectance  was  probed  at  an  oblique  incidence  an¬ 
gle  (0  =  50°)  with  s  and  p  polarizations.  Steeper  incidence 
angles,  though  valuable  for  characterizing  the  dielectric 
function  precisely,’’  were  avoided  because  of  the  much 
greater  difficulty  of  fully  overlapping  the  pump  and  probe 
spots  on  the  sample,  and  thus  in  obtaining  quantitatively 
accurate  absolute  reflectance  values.  The  probe  beam 
was  focused  to  a  spot  diameter  of  approximately  30  /rm 
(HOPG)  or  15  /irm  (diamond)  in  the  center  of  the  pump 
spot.  Acquisition  and  shot-to-shot  normalization  were 
accomplished  as  described  above.  As  an  additional  nor¬ 
malization,  each  spot  was  probed  twice;  first  without, 
then  with  the  pump,  accomplished  by  blocking,  then 
transmitting  alternate  pump  pulses  with  an  electronically 
controlled  shutter.  The  difference  between  these  two  sig¬ 
nals,  each  separately  normalized  to  the  reference  photo¬ 


diode  signal,  measured  the  change  in  reflectivity  (HOPG, 
diamond)  and  transmission  (diamond).  Multiple  laser 
shots  at  each  time  delay  were  then  averaged.  This  pro¬ 
cedure  corrected  for  possible  surface  nonuniformities  as 
the  sample  was  rastered,  and  significantly  improved 
signal-to-noise  ratio,  reproducibility,  and  accuracy  of  the 
measured  absolute  values  of  surface  reflectivity.  With  the 
reference  silicon  sample,  this  procedure  yielded  absolute 
reflectance  within  5%  of  independently  measured  values 
(see  Ref  43).’'””  Such  measurement  accuracy  is  criti¬ 
cal  to  extraction  of  accurate  Drude  parameters,  and  thus 
to  conclusions  regarding  the  nature  of  the  liquid  state  of 
carbon.  The  temporal  zero-delay  (  Iq  )  between  pump  and 
probe  was  calibrated  to  within  ±15  fs  by  monitoring  the 
well-characterized  reflectivity  response””  ’®  of  the  melt¬ 
ing  transition  of  the  reference  silicon  sample. 

III.  EXPERIMENTAL  RESULTS  AND  ANALYSIS 

A.  Postmortem  analysis  of  sample  damage 

Using  the  procedure  described  above,  we  find  the  criti¬ 
cal  damage  fluence  of  HOPG  to  be  0. 13+0.02  J/cm’ 
for  single  620-nm,  90-fs  optical  pulses,  very  close  to  the 
value  of  0.14  J/cm’  determined  by  Malvezzi  et  al.^  using 
20-ps,  532-nm  pulses.  From  known  equilibrium  material 
properties  of  ’’OPG,’  ’**  we  can  calculate  the  final,  equili¬ 
brated  surface  temperature  Tf  reached  slightly  below  F„ 
(independent  of  the  dynamical  pathway  by  which  it  is 
reached)  by  equating  absorbed  optical  energy  to  the  in¬ 
tegrated  heat  capacity: 

rO 

i\-R)F„a=p  I  CJDdT  .  (1) 

We  assume  temperature-independent  density  p  =  2. 25 
g/cm’  and  linear  absorption  (a  '  =  300  A),  which  is 
nearly  the  same  for  620  and  532  nm.’'’  A  functional  form 
of  temperature-dependent  heat  capacity  C^IT)  below  the 
melting  temperature  is  given  by  Eq.  (9)  of  Ref  5,  and 
pump  pulse  reflectivity  R  ~0. 3  at  F„  has  been  measured 
directly  (see  next  section).  The  time  constant  for  vertical 
heat  diffusion  out  of  the  absorption  volume  is  approxi¬ 
mately  500  ps,”'”’  and  can  be  neglected  on  the  time  scale 
( ~  1  ps)”  on  which  an  equilibrium  temperature  is  estab¬ 
lished.  We  obtain  T^~6000  K  for  both  20-ps  and  90-fs 
pulses,  20-50  91^  higher  than  the  high  pressure  melting 
temperature,’  of  HOPG.  However,  the  actual  tempera¬ 
ture  is  somewhat  lower  because  the  interband  absorption 
saturates”  as  fluence  approaches  F„.  reducing  the  value 
of  a  by  30-50%  in  Eq.  (1).  This  simple  equilibrium 
analysis  shows  t.ie  HOPG  indeed  reaches  the  melting 
temperature  at  F„, ,  and  suggests  that  melting  occurs  and 
causes  the  damage  spot  to  form  at  F  >  F„ . 

The  damage  threshold  for  diamond  was  measured  to  be 
0.63  *  0.15  J/cm’  for  620-nm,  90-fs  pulses.  The  fivefold 
higher  value  can  be  attributed  to  the  requirement  for 
simultaneous  absorption  of  at  least  three  2.0-eV  photons 
to  cross  the  indirect  energy  gap  of  5.48  eV  in  dia¬ 
mond.’  Self-reflectivity  and  transmission  measurements 
presented  below  indeed  confirm  that  strongly  nonlinear 
absorption  is  present  at  fluences  just  below  F„,  although 
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the  final  surface  temperature  Tf  therefore  difficult  to 
CO.  apute  accurately  because  a  depends  strongly  on  the 
pulse  energy  and  intensity  profile. 

Figure  1  presents  measurements  of  the  average  damage 
spot  radius  as  a  function  of  the  fluence  of  a  normally  in* 
cident  90-fs  pump  pulse  focused  to  a  spot  radius  of  30  fxm 
on  HOPG,  or  15  ^m  on  diamond.  At  F„,  a  small  dam¬ 
age  spot  (only  a  few  microns  in  radius)  appears  in  the 
center  of  the  irradiated  area.  For  F>F„,  the  damage  ra¬ 
dius  increases,  at  first  rapidly,  then  more  slowly  with  in¬ 
creasing  fluence.  For  thermal  melting  by  a  pulse  with 
Gaussian  spatial  profile  f  ^F^expl  the  damage 

spot  radius  should  increase  with  fluence  as^^ 

which  is  the  fraction  of  the  pulse 
cross-sectional  profile  in  which  F  >  F„.  The  curves 
through  the  data  points  in  Fig.  1  are  fits  of  this  formula 
to  the  measured  damage  radii  using  Po~30  pm  (HOPG) 
or  15  pm  (diamond),  which  agree  with  the  independently 
measured  pump  spot  radius.  From  Fig.  1  we  can  deter¬ 
mine  that  for  f  >0.4  J/cm^  (HOPG)  or  1.5  J/cm^  (dia¬ 
mond),  the  damage  spot  encloses  >  95%  of  the  energy  of 
a  Gaussian  profile  probe  pulse  focused  as  described 
above.  However,  below  this  fluence,  any  fluence  depen¬ 
dence  to  the  probe  reflectivity  must  be  attributed  partly 
to  the  changing  size  of  the  melted  region. 

Figures  2(ai  and  2(b)  present  measured  depth  profiles 
across  the  center  of  representative  damage  spots  induced 
by  90-fs,  620-nm  pulses  in  HOPG  and  silicon,  respective¬ 
ly.  Profiles  of  regions  irradiated  below  F„  show  no  evi¬ 
dence  of  craters  in  either  material.  At  2.0  F„  (0.26 
J/cm^)  in  HOPG,  a  typical  crater  with  a  depth  of  ~300 
A  is  shown.  In  silicon,  although  amorphization,  a  signa¬ 
ture  of  prior  melting  and  rapid  recrystallization,  is  clearly 
seen  under  a  microscope  in  the  irradiated  region,^^  the 
depth  profile  still  shows  insignificant  cratering  at  2.0  F„ 
(0,2  J/cm‘i.  At  4.0  F„,  a  typically  deeper  and  wider 
crater  is  shown  for  HOPG,  whereas  the  crater  shown  for 
silicon  at  5,0  F^  is  still  small  by  comparison.  The  plot  of 
average  crater  depth  versus  fluence  (normalized  to  F„  )  in 
Fig.  2(c)  summarizes  measurements  of  numerous  damage 
spots  in  HOPG  (filled  squares)  and  silicon  (open  squares). 
For  HOPG,  measurable  craters  are  observed  at  all 
fiuences  above  the  damage  threshold,  indicating  that  ma¬ 


terial  has  been  lifted  off  the  surface.  The  depths  increase 
monotonically  with  increasing  fluence,  approaching  1000 
A  at  12  F„.  In  silicon,  on  the  other  hand,  measurable 
craters  arc  observed  only  for  F>2. 5F„ ,  and  increase  in 
depth  up  to  about  6F„ .  At  fiuences  above  about  10F„  in 
silicon,  surface  mounds  appear,  which  may  be  caused  by 
freezing  of  the  molten  silicon  after  being  subjected  to  hy¬ 
drodynamic  shock  from  the  laser  pulse. The  higher 
threshold  for  crater  formation  in  silicon  can  be  attributed 
quantitatively^'  to  the  additional  energy  required  to  heat 
the  stable  liquid  to  the  vaporization  temperature,  and  to 
provide  the  latent  heat  of  vaporization.  Thus  for 
F>2.5F„,  the  liquid  is  above  the  vaporization  tempera¬ 
ture.  Most  carbon  phase  diagrams,^  on  the  other  hand, 
show  that  the  liquid  phase  exists  in  equilibrium  only  at 
pressures  above  100  atm,  and  is  thus  thermodynamically 
unstable  at  atmospheric  pressure,  where  it  tends  to  sub¬ 
lime.  Consequently,  material  loss  to  vaporization  is  ex¬ 
pected  as  soon  as  the  liquid  phase  is  formed  at  F„.  With 
femtosecond  excitation  at  F>  F,„,  material  loss  is  prob¬ 
ably  enhanced  compared  to  longer  pulsed  excitation,  be¬ 
cause  of  the  initially  elevated  pressure  (see  Sec.  IV). 
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FIG.  1.  Average  radius  of  damaged  area  as  a  function  of 
pump  fluence  for  HOPG  (solid  squares)  and  diamond  iopen 
squares)  after  photoe.xcitation  by  90-fs,  620-nm  laser  pulses  fo¬ 
cused  to  1/e  spot  radius  of  .10  /am  (HOPG)  or  7.5  /tm  (diamond). 


FIG.  2.  Depth  profiles  of  representative  damage  craters  on 
(a)  HOPG  and  (b)  silicon  for  two  fiuences  F  above  the  critical 
melting  fluence  F„,.  (c)  Average  crater  depth  as  a  function  of 
F/F^  for  HOPG  (solid  squares)  and  silicon  (open  squares). 
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B.  Self-reflectivity  and  transmission 
L  Graphite 

Figure  3(a)  shows  the  near-normal  incidence  self¬ 
reflectivity  of  the  central  part  of  the  profile  of  620-nm, 
90-fs  pulses  from  HOPG  as  a  function  of  fluence  normal¬ 
ized  to  F„.  Below  the  measured  self-reflectivity  is 
fluence  independent  and  indistinguishable  from  the  value 
of  0.3  measured  for  une.xcited  HOPG.  Precisely  at  F„, 
however,  self-reflectivity  begins  to  rise,  reaching  a  values 
above  0.5  at  20F„.  This  result  contrasts  with  the  self¬ 
reflectivity  response  observed  by  Malvezzi  et  a/.*  using 
532-nm,  20-ps  pulses,  where  self-reflectivity  decreases 
above  F„ . 

2.  Diamond 

Figure  3(b)  shows  the  self-reflectivity  ( R )  and  self¬ 
transmission  { T)  of  near-normally  incident  620-nm,  90-fs 
pulses  in  diamond  as  a  function  of  normalized  fluence.  In 
this  case  the  entire  reflected  and  transmitted  pulse  ener¬ 
gies  were  monitored,  and  the  plotted  R  and  T  values  in¬ 
clude  the  fraction  of  pulse  energy  which  was  multiply 
reflected  between  the  faces  of  the  250-^m-thick  sample. 
At  very  low  fluences  iF<0.01f,„),  R  +  T  =  1,  signifying 
the  absence  of  absorption.  In  the  range  0.01F„  <F  <F„, 
R  remains  nearly  fluence  independent,  while  T  drops 
sharply  to  less  than  t  of  its  low  fluence  value,  a  clear  sig¬ 
nature  of  multiphoton  absorption.  A  fit  of  the  data  to  a 
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FIG.  .T  (a)  Self-reflecli\ity  of  90-fs,  620-nm  pulses  normally 
Incident  on  HOPG  as  a  function  of  fluence  F  normalized  to  the 
critical  melting  fluence  F„.  (bl  Fluence  dependent  self¬ 
reflectivity  and  transmission  of  90-fs,  62C-nm  pulses  normally 
incident  on  diamond 


three-photon  absorption  model"  yields  a  three-photon 
absorption  coefficient  7~25  cmVriF\  although  the 
data  is  not  sufficiently  accurate  to  rule  out  the  simultane¬ 
ous  presence  of  higher  order  absorption  nonlineraities. 
Precisely  at  F„ ,  self-reflectivity  begins  to  increase  sharp¬ 
ly,  just  as  observed  in  HOPG,  while  self-transmission 
continues  to  decrease. 

The  precise  correlation  between  the  self-reflectivity  in¬ 
crease  and  the  damage  threshold  in  both  samples  suggests 
that  the  increase  is  caused  by  the  initial  dynamics  of  the 
melting  process  which  occur  during  the  pump  pulse,  as 
discussed  further  in  Sec.  IV.  Measurements  of  the  related 
process  of  fwo-photon  absorption  of  4.0-eV  femtosecond 
pulses  in  diamond  have  been  presented  elsewhere. 

C.  Femtosecond  pump-probe  measurements 
I.  Graphite 

a.  620-nm  wavelength  probe,  the  first  25  ps.  Figure  4 
presents  the  time-resolved  reflectivity  of  620-nm  probe 
pulses  from  HOPG  for  the  first  25  ps  following  excitation 
above  F,„  as  the  pump  fluence  increases  from  0.5  J/cm‘ 
to  20  J/cm‘.  This  time  interval  is  long  enough  to  show 
all  of  the  major  features  of  the  reflectivity  response  with 
relatively  coarse  time  resolution.  Beyond  At  =25  ps,  the 
visible  reflectivity  remains  nearly  constant  for  approxi¬ 
mately  a  nanosecond,  as  also  observed  by  Malvezzi 
el  alfi 

Figure  4(a)  shows  three  reflectivity  responses  corre¬ 
sponding  to  a  common  pump  fluence  {F=0.5  J/cm'l, 
but  to  three  different  probe  polarizations:  s  polarized  at 
0=50°,  near  normal  incidence,  and  p  polarized  at  0  =  50°. 
The  measured  reflectivities  for  Af  <0  agree  with  Fresnel 
equation  predictions  when  the  published  optical  con¬ 
stants^'’  n  =2.6,  A-  =  1.5  for  unexcited  graphite  at  620  nm 
are  used.  For  Af>0,  the  inequality  R,(0=5O°) 
>Rp(0=  1O°)>/J^(0=5O°)  is  maintained  at  all  time  de¬ 
lays.  At  At  =0,  reflectivity  increases  sharply  with  a  pulse 
width  limited  rise  time  to  a  value  roughly  50%  higher 
than  the  initial  value.  There  is  a  slight  “overshoot”  in 
the  reflectivity,  evident  as  only  a  single  data  point  on  this 
coarse  time  scale,  which  recovers  in  a  fraction  of  a  pi¬ 
cosecond.  There  follows  a  slower  decay  to  the  initial 
value  in  ~8  ps.  then  a  continuing  drop  to  a  final  steady 
state  value  in  20  to  40  ps.  From  the  three  measured 
reflectivities,  the  complex  index  of  refraction  n  +ik  can 
be  determined  uniquely  at  any  time  delay  from  Fresnel's 
equations,  provided  that  the  surface  is  assumed  to  remain 
perfectly  discrete.  Thus  at  0.3  ps  <  Ar  <  1  ps,  we  obtain 
n  =1.35,  A  =1.93,  showing  increased  metallic  character. 
At  time  delays  greater  than  several  picoseconds,  howev¬ 
er,  the  assumption  of  a  discrete  interface  may  no  longer 
be  valid,  as  discussed  further  in  Sec.  IV. 

Figures  4(b)  and  4(c)  show  the  reflectivity  response 
probed  at  near  normal  incidence  at  progressively  higher 
pump  fluences  of  2.0  and  3.9  J/cm^  respectively.  Again 
reflectivity  increases  sharply  at  A/  =0,  reaching  a  max¬ 
imum  value  which  increases  slightly  with  pump  fluence. 
The  subsequent  drop  in  reflectivity  depends  strongly  on 
pump  fluence.  The  reflectivity  decays  more  rapidly,  .an.! 
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the  final  reflectivity  value  becomes  smaller,  as  pump 
fluence  increases.  This  trend  continues  at  higher 
fluences,  as  shown  by  the  oblique  incidence  probe 
reflectivity  data  at  20  J/cm*  pump  fluence  in  Fig.  4(di 
(data  for  a  near  normally  incident  probe  is  not  shown  in 
this  figure).  The  general  pattern  of  a  sharp  reflectivity  in¬ 
crease  followed  by  a  slower,  but  fluence-dependeni  de¬ 
crease  closely  resembles  the  reflectivity  response  of  sil¬ 
icon^'  when  melted  by  femtosecond  pulses  above  the 
vaporization  fluence  F  >  2. 5F„, -0. 25  J/cm*.  The  main 
difference  is  that  in  silicon  the  reflectivity  decay  is  not  ob¬ 
served  for  pump  fluences  F„<F<2  5F„  between  the 


FIG.  4.  Femtosecond  time-resolvcd  refieciivity  of  HOPG 
lai  Excitation  at  pump  fluence  F  =0.5  J/cm".  probe  at  620  nm. 
comparing  50°  incideni  anMc  t  and  p  polarized  reflectivity,  and 
near-normal  incidence  reflecuviiy.  (bi  Near-normal  incidence 
620-nm  reflectivity  at  F  -2.0  J/cm":  dashed  line  shows  convo¬ 
lution  of  data  with  20-p  i  pump  and  probe  pulses,  c'  Same  with 
F  '  3  9  J/cm*.  (di  Excitation  at  20  J/cm",  probe  at  620  nm, 
comparing  50°  incident  angle  v  and  p  polarized  rerleelivity. 
Vertical  scales  on  all  panels  denote  absolute  reflectnily. 


melting  and  vaporization  fluences,  whereas  in  HOPG,  the 
decay  is  observed  at  all  fluences  F  >  F^.  Thus  the 
reflectivity  decay  correlates  with  the  appearance  of  dam¬ 
age  craters  described  above,  suggesting  that  hydro- 
dynamic  surface  e.xpansiun  causes  the  reflectivity  decay. 

It  IS  helpful  to  fit  the  observed  reflectivity  responses  m 
Figs.  4(a)-4id)  phenomenologically  to  a  function 
which  convolves  a  single  exponential  response  function 
/■(f)  —  exp(  -  ( f  -  f  ’  i/Tj  ]  C  with  the  temporal  intensity 
envelopes  and  P,„,,|,^!f.i  of  the  pump  and  the 

probe  pulses: 

1  Af  i  f  f  A f ;  /  '  ^  1  f '  ir  ( / '  )df  'df  . 

(2» 

is  the  relaxation  time  constant,  and  the  modified  base¬ 
line  C  represents  the  final  reftcctiviiy  value  The  pump 
and  probe  intensity  envelopes  are  conveniently  represent¬ 
ed  as  sech‘i(  /T„i  Examples  of  fils  to  Flq.  (2*,  using  (he 
measured  pube  width  r.  -- 9{)  fs.  are  shown  by  the  curves 
superuuposed  on  the  data  in  Figs.  4ib!  and  4ici  where 
time  constants  and  4  ps,  respectively,  were  used. 

Fits  of  equal  quality  are  obtained  for  other  fluences.  In 
Fig.  5  these  decay  lime  constants  obtained  in  the  same 
manner  from  femtosecond  reflectivity  data  on  silicon 
phoioexcited  at  F  '•0.25  J/cm"  are  also  plotted  isolid  cir- 
vlesi.  For  similar  fluences,  similar  time  constants  are  ob¬ 
tained  m  HOPG  and  silicon.  The  solid  line  is  a  theoreti¬ 
cal  curve  which  predicts  a  characteristic  time  constant 
for  hydrodynamic  surface  expansion,  which  decreases 
with  laser  fluence  in  a  manner  very  similar  to  This 
model  is  discussed  further  in  Sec.  IV. 

Equatuin  (2)  can  also  be  used  to  calculate  the 
reflectivity  response  RiAi)  which  would  be  observed  in 
an  experiment  with  longer  pump  and  probe  pulses.  The 
grey-tone  curves  in  F'igs.  4(h)  and  4(ci  were  obtained  us¬ 
ing  the  same  response  function  ri  n.  but  =20  ps.  corre¬ 
sponding  to  the  pulse  duration  used  by  Malvezzi  ef  al.^ 
Clearly  the  initial  reflectivity  increase  cannot  be  observed 
at  all  with  this  lime  resolution.  In  addition,  the  fluence 
dependence  of  the  reflectivity  drop  is  largely  masked. 
Consequently  the  calculated  response  shows  only  a 


FIG.  5.  Fluence  dependence  of  the  decay  time  constant  of 
the  reflectivity  response  of  HOPG  lopen  circles)  and  silicon 
(solid  circlcsi  The  solid  line  is  a  iheorelical  fit  to  a  F  ' 
dependence,  as  discussed  in  ibe  text 
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monotonic  reflectivity  drop  following  photoexcitation, 
consistent  with  the  picosecond  data  of  Malvezzi  ei  al.  “ 
One  additional  feature  of  the  Fig.  4  data  is  noteworthy. 
Maximum  s-polarized  reflectivity,  observed  at  A/~l  ps, 
increases  form  0.58  at  0.5  J/cm’  [Fig.  4(a)]  to  0.69  at  20 
J/cm^  [Fig.  4(d)],  a  trend  also  observed  in  normal  in¬ 
cidence  reflectivity.  .Maximum  /j-polarized  reflectivity, 
on  the  other  hand,  decreases  from  0.3  (0.5  J/cm^)  to  0.26 
(20  J/cm^).  The  selective  suppression  of  p-polarized 
reflectivity  at  such  a  high  fluence  as  20  J/cm^  probably 
results  from  a  significant  degree  of  ionization,  and  the 
formation  of  a  short  scale  length  plasma  density  gradient 
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FIG.  6,  The  first  picosecond  of  the  lime-resolved  reflectivity 
response  of  HOPG  probed  by  90-fs,  62()-iim  pulses  at  near  nor¬ 
mal  incidence,  (a)  Excitation  at  pump  fluence  /•'  below  critical 
melting  fluence  F,„;  (bl  F  --  F,„ ;  fci  -'H  increasing  fluences  above 
F„.  Vertical  scales  denote  absolute  reflectivity. 


at  the  surface.  Consequently  resonance  absorption  selec¬ 
tively  attenuates  p-polarized  light.”  Similar  effects  have 
been  observed  and  analyzed  by  Milchberg  et  Since 
x-  and  p-polarized  reflectivity  increase  about  equally  at 
lower  fluences  [e.g..  Fig.  4(a)],  however,  we  can  assume 
that  the  surface  is  discrete  at  early  time  delays. 

b.  620-nm  probe  wavelength,  the  first  picosecond.  Fine 
time-scale  data  over  the  first  picosecond  documents  the 
initial  stage  of  photoexcitation  and  melting  dynamics. 
Figure  6  shows  the  first  picosecond  of  the  reflectivity 
response  probed  at  near-normal  incidence  as  the  pump 
fluence  is  varied  from  0.033  to  3.9  J/cm^.  Just  below  the 
threshold  [F  =0.033  J/cm^,  Fig.  2(a)],  a  small  decrease  in 
reflectivity  is  observed  during  the  pump  pulse,  which  re¬ 
covers  in  ~  2(X)  fs.  This  response  is  caused  by  the  genera¬ 
tion,  relaxation,  and  recombination  of  an  electron-hole 
plasma,  and  is  in  good  quantitative  agreement  with  the 
extensive  reflectivity  measurements  below  F„  reported  by 
Seibert  et  al.~^  When  the  pump  fluence  is  increased  to 
exactly  0.13  J/cm',  a  nearly  flat  response  is  seen,  as 
shown  in  Fig.  2(b).  As  soon  as  the  fluence  is  increased 
above  F„,  a  sharp  reflectivity  increase  is  observed 
[F=0.25  J/cmv  Fig.  2ic)],  followed  by  a  fast  (  ~200  fs) 
recovery  and  a  small  residual  reflectivity  increase,  which 
persists  for  over  10  ps.  For  the  data  in  Figs.  2(b)  and  2(c), 
the  damage  spots  were  smaller  than  the  probe  spot.  Con¬ 
sequently.  the  measured  response  averages  the  actual 
response  below  and  above  F,„  to  some  extent.  At  higher 
fluences,  however,  essentially  the  entire  probe  pulse  sees 
material  excited  above  F„.  At  F  =0.1  J/cm’  and  1.3 
J/cm’  [Figs.  2(d)  and  2(e)],  the  fast  transient  is  still  evi¬ 
dent,  but  the  persistent  reflectivity  component  increases 
sharply  in  magnitude  to  over  0.4.  At  still  higher  pump 
fluence  [F  =  3.9  J/cm".  Fig.  2(f)],  the  fast  transient  is  no 
longer  evident,  and  a  slight  further  increase  to  F  -0.5 
occurs. 

Two  important  features  of  the  initial  reflectivity 
response  above  F,„  are  noteworthy.  First,  the  rise  times 
of  reflectivity  signals  in  Figs.  6(c)-6(e)  are  all  pulse  width 
limited.  This  contrasts  with  a  resolvable  delay  of  ~2(X) 
fs  in  the  rise  time  of  the  reflectivity  of  sili;on  when  pho- 
toexcited  just  above  F,„ . This  comparison  suggests  that 
the  initial  response  of  HOPG  above  F„  is  caused  by  an 
overdense  electron-hole  plasma,  which  rapidly  equili¬ 
brates,  whereas  the  electron-hole  plasma  in  silicon  fails  to 
reach  critical  density  just  above  F,„,  yielding  a  response 
time  governed  by  the  melting  dynamics.  Second,  the  fast 
transient  observed  above  F„, ,  though  opposite  in  sign,  re¬ 
covers  on  the  same  time  scale  as  the  transient  observed 
below  F^  [Fig.  6(a)  and  Ref.  23].  This  comparison  sug¬ 
gests  that  the  former,  like  the  latter,^’  is  also  caused  by 
equilibration  of  hot  electrons  with  the  lattice,  as  dis¬ 
cussed  further  in  Sec.  IV, 

c.  Other  probe  wavelengths.  A  reflectivity  spectrum 
over  as  wide  a  range  of  wavelengths  as  possible  is  key  to 
modeling  the  underlying  electronic  structure  of  liquid 
carbon.  Figure  7  shows  the  time-resolved  reflectivity  of 
3l0-nm,  probe  pulses  from  HOPG  following  excitation 
below  F„,  [Fig.  7(a)]  and  at  two  fluences  above  F„  [Figs. 
7(b)  and  7(c)].  The  response  differs  substantially  from 
that  at  620  nm.  Following  sample  excitation  at  0  1/ 
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[Fig.  7(a)],  ultraviolet  reflectivity  first  decreases  for  the 
first  100  fs,  then  recovers  within  2  ps  to  a  steady-state 
value  4%  higher  that  the  initial  reflectivity,  which  per¬ 
sists  for  ~500  ps.  A  detailed  interpretation  of  this 
response  has  been  presented  elsewhere."'  The  initial  de¬ 
crease  is  caused  by  the  rapid  generation  and  energy  relax¬ 
ation  of  electrons  and  holes,  while  the  slowly  rising 
reflectivity  is  caused  by  the  thermal  renormalization  of 
the  graphite  rr  energy  bands,"'  and  provides  a  measure  of 
the  time  required  for  full  electron-lattice  equilibration. 
The  ultraviolet  reflectivity  response  above  f,„  dilfers 
significantly  from  the  response  when  excited  below  F„,, 
providing  further  evidence  that  an  irreversible  phase 
transition  occurs  at  F,„.  At  F  =  0.7  J/cm%  no  reflectivity 
increase  whatsoever  is  observed  with  the  ultraviolet 
probe,  in  sharp  contrast  to  the  620-nm,  probe  (Figs.  4<a) 
and  6(d)].  Instead,  reflectivity  remains  unchanged  for  the 
first  picosecond,  then  monotonically  decreases  within  20 
ps  to  a  steady-state  value  '  R  =0.05 !,  which  persists  for  at 
least  300  ps.  When  pump  fluence  is  increased,  the 
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reflectivity  drops  more  rapidly,  and  to  a  lower  final  value, 
the  same  trends  observed  with  the  620-nm,  probe.  In  ad¬ 
dition,  when  the  pump  fluence  is  increased  to  4  J/cmF  a 
slight  transient  refleciiviiy  increase  (barely  above  the 
noise  level!  is  observed  during  the  first  -  0. 5  ps.  as  shown 
on  the  magnified  scale  of  Fig.  7(bi.  This  transient  evolves 
on  approximately  the  same  time  scale  as  the  refleciiviiv 
transients  shown  in  Figs,  6(a)~6(ei  and  7ta!.  We  aitribute 
this  to  the  initially  nonequilibrium  electron-hole  plasma 
which  at  this  fluence  has  become  sufficiently  dense  to  be 
detected  at  310  nm. 

Figure  8  shows  fluence  dependent  reflectivity  measure¬ 
ments  made  at  fixed  time  delays  of  0.7  and  20  ps,  using  a 
white  light  continuum  probe,  where  wavelengths  of  550 
nm  [Fig.  8(a)j,  620  nm  (Fig.  8(b)j,  and  7(X)  nm  (Fig.  8(ci] 
were  selected  for  detection.  A  delay  of  0.7  ps  probes  the 
high  reflectivity  portion  of  the  evolution  after  the  initial 
transient  2(X)-fs  features,  while  a  delay  of  20  ps  probes  the 
approach  to  a  low  steady-slate  reflectivity .  In  both  cases, 
no  change  in  refleciiviiy  occurs  until  ihe  critical  melting 
threshold  is  reached.  At  Al  --0.7  ps,  Ihe  refleciiviiy  be¬ 
gins  to  rise  precisely  al  the  critical  melting  fluence  at  all 
probe  wavelengths,  reaching  a  final  value  of  -  1.5/?,,  al 
I0F,„.  In  addition,  the  reflectivity  scales  equally  with 
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fluence  in  each  case.  Spot  size  effects  contribute  to  the 
increasing  magnitude  of  the  reflectivity  in  the  fluence 
range  <F  <  3F„ .  Above  3F„ ,  however,  the  increas¬ 
ing  reflectivity  is  caused  entirely  by  changing  properties 
of  the  material  as  its  temperature  increases.  The  scale  at 
the  top  of  Fig.  8(a)  gives  a  rough  estimate  (±50%)  of 
sample  temperature  at  At  =0.7  ps  derived  by  an  exten¬ 
sion  of  Eq.  (1),  as  discussed  further  in  Sec.  IV.  At  a  given 
fluence  (temperature)  reflectivity  is  slightly  higher  at 
longer  wavelengths.  At  At  =20  ps,  for  all  wavelengths, 
the  reflectivity  drops  below  the  background  HOPG  value 
precisely  at  the  critical  melting  fluence,  and  decreases 
monotonically  with  fluence.  A  very  slight  rise  in 
reflectivity  is  observed  below  the  critical  melting  fluence 
at  620  nm,  as  observed  in  previous  studies^^  below  F„, 
and  is  related  to  electron-hole  carrier  generation. 


2.  Diamond 


Below  F„  very  different  femtosecond  responses  are  ex¬ 
pected  in  graphite  and  diamond,  because  of  the  very 
different  electronic  band  structures.  Above  F„,  however, 
we  expect  to  measure  similar  optical  properties,  if  the 
two  samples  melt  and  equilibrate  to  a  common  liquid 
phase  on  the  time  scale  of  the  measurements. 

Figure  9  shows  the  time-resolved  reflectivity  and 
transmission  of  620-nm  probe  pulses  from  diamond  fol¬ 
lowing  excitation  just  below  F„  fFig.  9(a)l  and  well  above 
F„  [Figs.  9(b)  and  9(c)].  For  these  measurements,  the 
probe  is  incident  at  a  large  enough  angle  (  -  20°)  that  the 
front  surface  reflection  has  been  spatially  separated  from 
the  back  surface  reflection.  Below  F,„  [Fig.  9(a)],  no 
change  in  front  surface  reflectivity  was  detectable.  How¬ 
ever,  transmission  drops  to  0.45  immediately  upon  pho¬ 
toexcitation.  with  negligible  recovery  over  tens  of  pi¬ 
coseconds.  We  believe  this  response  to  be  free  carrier  ab¬ 
sorption  in  an  electron-hole  plasma  generated  by  three- 
photon  absorption.  The  recovery  rate  is  severely  limited 
compared  to  graphite  because  of  the  large  energy  gap 
which  separates  electrons  and  holes  following  their  relax¬ 
ation  to  band  edges.  Above  F„ ,  we  observe  on  a  coarse 
time  scale  [Fig.  9(c)]  a  reflectivity  response  qualitatively 
very  similar  to  the  graphite  responses  shown  in  Figs.  4:  a 
sharp  initial  increase  followed  by  a  slower  decay  over 
— 10  ps.  The  lower  peak  reflectivity  value  (  —0.3)  com¬ 
pared  to  HOPG  excited  at  10F,„(  —0.45)  is  partly  an  ar¬ 
tifact  resulting  from  the  coarse  time  spacing  of  data 
points  used  to  conserve  sample  area  in  taking  the  Fig. 
9(c)  data.  When  repeated  at  the  same  fluence  (F=9F„, ) 
with  finer  time  resolution  over  a  shorter  time  interval 
[Fig.  9(c)],  the  data  indeed  show  that  diamond  reflectivity 
increases  from  0.17  to  approximately  0.4,  very  close  to, 
although  still  slightly  lower  than,  the  value  observed  with 
HOPG.  This  small  discrepancy  may  be  a  real  difference 
in  the  reflectivity  of  the  liquid  phases,  perhaps  because 
the  liquid  initially  retains  the  higher  density  of  diamond 
(3.5  g/cm’)  compared  to  graphite  (2.2  g/cm  '). 

The  corresponding  transmission  response  of  diamond 
excited  at  }F^  (solid  squares)  and  6F,„  (open  squares)  is 


shown  in  Fig.  9(b;  Transmission  decreases  abruptly  at 
^it  =0  from  Its  unexcited  value  (0.7)  to  less  than  0.2.  On 
the  time  scale  (25  ps)  on  which  reflectivity  undergoes 
large  changes,  no  further  change  in  transmission  is  ob¬ 
served.  The  slightly  higher  final  transmission  value  at 
2F„  results  from  the  probe  spot  being  slightly  larger  than 
the  damage  spot.  A  similar  transmission  response  is  ob¬ 
served  with  silicon-on-sapphire  upon  melting  with  a  fem¬ 
tosecond  pulse. 
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FIG.  9.  Femtosecond  time-resolved  from  surface  reflectivity 
(  R )  and  transmission  (  T)  of  diamond,  probed  at  near  normal  in¬ 
cidence  by  90-fs,  620  nm  pulses,  (a)  Pump  fluence  F  =  n,6F„; 
(b)  T  at  F  (solid  squares)  and  6F,„  (open  squares);  (c)  R  at 

F  -'9F,^  on  coarse  time  scale;  id)  detail  of  first  picosecond 
rcfleclivity  response  at  F  -  ‘)F„ . 
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IV.  DISCUSSION 
A.  Hydrodynamic  surface  expansion 

In  order  to  obtain  meaningful  optical  data  on  the  liquid 
phase,  it  is  critical  to  collect  data  before  surface  optical 
properties  deteriorate  because  of  hydrodynamic  expan¬ 
sion.  Femtosecond  experiments  remove  ail  ambiguity 
that  the  surface  can  be  probed  before  the  surface  expands 
significantly.  The  consistent  observation  of  an  early 
reflectivity  increase  (unresolvable  in  ps  experiments®)  fol¬ 
lowed  by  a  fluence  dependent  decrease  is  therefore  simply 
and  naturally  explained  as  initial  creation  of  a  highly 
reflective  condensed  phase,  which  is  subsequently  ob¬ 
scured  by  surface  expansion.  Indeed  there  is  general 
agreement*  that  at  fluences  F>  5F„,  ablation  attenu¬ 
ates  surface  reflectivity  within  several  picoseconds  of 
photoexcitation. 

Nevertheless,  some  authors*'  ”  have  argued  that  during 
a  20-pb  melting  pulse  in  the  limited  fluence  range 
F,^<F<5F„,  insufficient  material  can  evaporate  to 
affect  surface  reflectivity  significantly,  and  that  reduced 
reflectivity  in  this  fluence  range  must  therefore  be  as¬ 
cribed  to  intrinsic  (electrically  insulating)  properties  of 
the  condensed  surface.  Analogously,  in  our  previous  pa¬ 
per.''*  we  did  not  rule  out  the  possibility  that  the  delayed 
reflectivity  decrease  for  F„  <  F  <  5F„,  observed  in  fs  ex¬ 
periments  could  be  caused  by  a  delayed  metal-io- 
insulator  transition  within  the  liquid,  which  maintained  a 
discrete,  unobscured  interface. 

Several  cogent  arguments  can  now  be  advanced  against 
this  interpretation  of  the  fs  experiments.  First,  if  the 
reflectivity  decrease  corresponded  to  an  unobscured  insu¬ 
lating  material  developing,  a  transmission  recovery  on 
the  same  time  scale  as  the  reflectivity  decrease  would  be 
e.xpected.  Yet  no  such  recovery  is  observed  in  the  dia¬ 
mond  transmission  measurements  at  any  pump  fluence 
[Fig.  9(b)].  Instead  transmission  drops  sharply  upon  pho¬ 
toexcitation  to  a  value  which  remains  constant  for  over 
25  ps.  On  the  other  hand,  this  pattern  is  consistent  with 
the  early  formation  of  a  glossy,  reflective  liquid  and  sub¬ 
sequent  surface  expansion.  Such  transmission  measure¬ 
ments  were  not  available  in  the  previous  report  on  graph¬ 
ite."*  Second,  the  careful  damage  morphology  analysis 
now  shows  that  delayed  reflectivity  decreases  following  fs 
melting  in  both  graphite  and  silicon  are  precisely  correlat¬ 
ed  with  appearance  of  a  damage  crater,  and  thus  with  re¬ 
moval  of  material  from  the  surface  at  some  time  after  ir¬ 
radiation.  Specifically,  in  silicon,  damage  craters  and  de¬ 
layed  reflectivity  decreases* ‘  both  are  observed  only  for 
F>2.5F„,  the  fluence  required  to  heat  the  liquid  to  the 
boiling  temperature  and  provide  the  latent  heat  of  vapori¬ 
zation;  corre.spondingly  in  graphite,  damage  craters  and  a 
delayed  reflectivity  increase  both  occur  for  F>F,„,  sug¬ 
gesting  that  liquid  carbon  spontaneously  ablates  at  at¬ 
mospheric  pressure.  Third,  the  time  constants  of  the 
reflectivity  decrease  (Fig.  5),  have  nearly  the  same  values 
and  fluence  dependence  for  femtosecond-irradiated  car¬ 
bon  and  silicon,  suggesting  a  common  ablative  mecha¬ 
nism.  Finally,  no  theoretical  basis  exists  for  structural  or 
electronic  equilibration  within  the  liquid  on  a  10-30-ps 


time  scale.  On  the  contrary,  recent  molecular  dynamics 
simulations  of  the  liquid  state  of  carbon*  show  that  the 
liquid  reaches  steady  state  structural  and  electronic  prop¬ 
erties  within  ~  1  ps  of  heating  to  the  melting  tempera¬ 
ture. 

In  order  to  reconcile  the  observation  of  a  strong 
reflectivity  decrease  within  -- 10  ps  with  the  very  small 
evaporation  rate  expected*  ”  at  F  <5F^,  we  must  recon¬ 
sider  the  microscopic  mechanism  for  the  reflectivity  de¬ 
crease.  The  "evaporation”  model  described  in  Refs.  8 
and  37  in  the  context  of  ps  irradiation  depicts  the  escape 
of  individual  atoms  from  the  liquid  to  form  an  absorptive 
vapor  above  the  liquid  surface,  which  is  assumed  to  re¬ 
tain  an  atomically  sharp  interface.  Indeed  optical  ab¬ 
sorption  from  the  expected  vapor  density  at  Al  -  10  ps  is 
clearly  inadequate  to  account  for  the  observed  reflectivity 
decrease  at  F  <  SF„ .  On  the  other  hand,  recent  studies  of 
metal  surfaces  irradiated  by  intense,  femtosecond 
pulses®*  **  show  that  strong  reflectivity  decreases  within  a 
few  picoseconds  are  caused  by  hydrodynamic  expansion 
of  the  surface  as  a  whole,  because  of  the  high  transient 
pressure  created  by  femtosecond  irradiation  (Sec.  IlIC 
below).  The  initially  atomically  abrupt  interface  develops 
a  short  scale  length  density  gradient  shortly  following  ex¬ 
citation,  similar  to  the  hydrodynamic  expansion  of  a  sur¬ 
face  experiencing  a  strong  shock  wave.’*  When  the  gra¬ 
dient  scale  length  L  approaches  a  significant  fraction  of 
an  optical  wavelength  A.,  surface  reflectivity  can  no  longer 
be  described  by  Fresnel's  equations,  which  are  based  on 
boundary  conditions  for  an  abrupt  interface,  but  instead 
requires  numerical  treatment  via  the  Helmholtz  wave 
equation.’*  Typical  calculations’*  '*  show  strong 
reflectivity  decreases  for  £>0.23,,  with  complicated 
dependence  on  polarization,  wavelength,  material,  and 
the  precise  form  of  the  density  gradient. 

In  order  to  estimate  the  time  r  for  such  a  density  gra¬ 
dient  to  form  under  our  experimental  conditions,  we  use 
the  expression  for  hydrodynamic  expansion  velocity  of  a 
surface  experiencing  a  strong  shock  wave;’* 


where  <  u  )  is  the  average  particle  velocity  in  the  normal 
direction,  y  =  Cp/C,  is  the  ratio  of  electronic  heat  capa¬ 
cities  at  constant  pressure  and  volume,  Z  is  the  effective 
ion  charge,  T  is  the  surface  (electron)  temperature,  and  M 
is  the  mass  of  a  carbon  atom.  Estimating  y~1.6  and 
Z--4  for  carbon  at  high  temperatures,-''  we  obtain 
(y  ) -'2[<;flr(eV)]'^®X  10*  cm/s,  close  to  the  speed  of 
sound.  To  a  fair  approximation,  k^T ~0.5  eV  at  F  =F,„, 
and  kgT «  Fat  higher  fluences.  Thus  at  F  =4F,„,  for  ex¬ 
ample,  a  scale  length  £=0.5A  =  310  nm  develops  in  a 
time  t~L/{  c  )  ~- 10  ps,  consistent  with  the  time  scale  of 
the  reflectivity  decrease  in  Fig.  4(a).  To  obtain  a  fit  to  the 
fluence  dependence  of  the  exponential  decay  constants  r 
plotted  in  Fig.  5,  we  assume  that  surface  reflectivity  is  re¬ 
duced  by  e  '  when  an  £  develops  over  a  fraction 
0. 1  <  /  <  1  of  the  probe  wavelength  A.,  where  /  can  be 
treated  as  a  fitting  parameter  which  depends  on  polariza¬ 
tion.  wavelength,  material,  and  the  form  of  the  densiiv 
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gradient.  The  time  for  the  necessary  gradient  to  form  is 
then  T=fk/(v).  As  the  fluence  (and  therefore  T)  in¬ 
creases,  {v}  increases  as  ^  resulting  in  a  reduced 

gradient  formation  time.  The  solid  (graphite)  and  dashed 
(siUcon)  theoretical  curves  in  Fig.  5  are  fits  to  the  data  ob¬ 
tained  using  /=0.6  (graphite)  or  0.15  (^silicon),  corre¬ 
sponding  to  Z,  =  3720  A  (graphite)  or  930  A  (silicon). 

Thus  both  the  magnitude  and  fluence  dependence  of 
the  reflectivity  decay  time  constant  r  in  both  silicon  and 
graphite  (Fig.  5)  are  consistent  with  the  one-dimensional 
expansion  rate  of  condensed  material  surface  into  a  vacu¬ 
um  after  being  pressurized  and  heated  by  the  absorption 
of  a  femtosecond  pump  pulse.  Significantly,  the  fluence 
dependence  of  r  for  both  F„  <F<  5F„  and  F  >  5F„  fit 
naturally  to  a  single  model,  further  confirming  that  the 
delayed  reflectivity  decreases  in  both  fluence  ranges  of  fs 
excitation  are  caused  by  a  common  mechanism  of  surface 
hydrodynamic  expansion.  Thus  if  the  influence  of  hydro- 
dynamic  expansion  is  to  be  ruled  out  unambiguously,  the 
optical  properties  of  a  condensed  phase  driven  to  elevated 
temperature  and  pressure  by  a  femtosecond  pulse  must  be 
probed  within  a  critical  time  window  of  no  more  than  a 
few  picoseconds. 

B.  Electron-lattice  and  structural  equilibration 

A  key  question  then  becomes:  to  what  extent  has  the 
photoexcited  carbon  equilibrated  within  the  available 
time  window?  Several  lines  of  evidence  suggest  that  elec¬ 
tron  and  lattice  temperatures  substantially  equilibrate 
within  ~  1  ps  following  photoexcitation  of  the  carbon 
sample.  First,  femtosecond  spectroscopy  of  graphite  ex¬ 
cited  below  F„  shows’’  that  photoexcited  carriers  relax 
almost  completely  within  1  ps.  The  graphite  lattice  tem¬ 
perature  is  measured  directly  by  the  rising  UV  reflectivity 
response  in  Fig.  7(a),”  which  confirms  that,  below  F,„, 
the  lattice  reaches  its  maximum  temperature  within  2  ps, 
and  has  nearly  reached  the  maximum  within  1  ps.  Simi¬ 
lar  or  shorter  electron-lattice  equilibration  times  have 
been  measured  for  many  metals.'**’  As  the  lattice  melts, 
the  increased  disorder  should  further  accelerate 
electron-lattice  equilibration,  as  observed,  for  example,  in 
comparisons  of  carrier  lifetimes  in  crystalline,'*’  amor¬ 
phous,'”  and  liquid”  silicon. 

Second,  the  subpicosecond  reflectivity  transient  ob¬ 
served  upon  photoexciting  graphite  above  F,„  (Figs. 
6(b)-6(e)  and  7(c)]  recovers  on  a  similar  time  scale  to  the 
transient  below  F,„.  In  our  previous  paper,”’  we  showed 
that  this  reflectivity  “overshoot"  at  620-nm,  probe  wave¬ 
length  [Figs.  6(b)-6le)]  is  explained  by  the  instantaneous 
generation  of  an  overdense  electron-hole  plasma 
( A'j.j,  >  lO”  cm  ’ )  upon  photoexciting  HOPG  at  F  ^  F,„ . 
With  an  ultraviolet  probe  [Figs.  7(b)  and  7(c)],  higher 
pump  fluence  (  ~  10F,„ )  and  carrier  density 
( >  4  X  lO”  cm  ’ )  is  required  to  see  this  overshoot,  as 
expected  from  the  Drude  dielectric  function,'”  and 
confirms  the  interpretation  of  the  overshoot  as  a  short¬ 
lived  electron  hole  plasma.  The  common  subpicosecond 
recovery  time  of  these  reflectivity  transients  both  below 
and  above  F,„  supports  the  interpretation  that  electron 
and  lattice  subsystems  equilibrate  to  nearly  a  common 


temperature  within  1  ps. 

Third,  molecular  dynamics  iMD)  simulation  of  the 
liquid  state  of  carbon  by  Galli  et  al.*'  show  that  structur¬ 
al  equilibration  is  also  very  rapid:  the  structural  and 
electronic  properties  of  an  initial  amorphous  carbon 
structure  heated  to  5000  K  in  the  electronic  ground  state 
reach  a  steady-state  liquid  structure  in  well  under  1  ps. 
Simulations  which  begin  by  heating  a  crystalline  graphite 
or  diamond  structure  might  yield  a  somewhat  longer 
equilibration  time,  and  would  be  of  great  interest  in  inter¬ 
preting  femtosecond  experiments.  Nevertheless  our  ob¬ 
servation  of  very  similar  subpicosecond  reflectivity  evolu¬ 
tion  of  graphite  and  diamond  samples  excited  at  F  >  F„ 
[Figs.  6  and  9(c)]  strongly  suggests  that  both  have  equili¬ 
brated  to  a  common  liquid  phase  at  Af  ~  1  ps. 

Finally,  in  a  well-studied  material  as  silicon,  subpi¬ 
cosecond  reflectivity  transients  are  also  observed,’®  but 
reach  steady-state  values  within  1  ps””  which  are  fully 
consistent  with  the  independently  measured  optical  prop¬ 
erties  of  fully  equilibrated,  over-melted  silicon.””  In 
fact,  femtosecond  and  picosecond  experiments  at 
F„,<F<2.5F„,  show  a  constant  reflectivity  for  0.5 
ps<At  <  1  ns  or  more.”  ”  Femtosecond  surface  second 
harmonic  measurements”  show  that  long-range  crystal¬ 
line  order  is  lost  in  0.5  ps.  demonstrating  that  the  mea¬ 
sured  reflectivity  is  indeed  that  of  a  liquid  phase. 

Available  evidence  thus  supports  the  conclusion  that 
optical  properties  probed  at  A/  ~  1  ps  are  those  of  a  liquid 
state  of  carbon  with  substantially  equilibrated  electron 
and  lattice  subsystems.  At  very  high  fluences  F»F„, 
the  term  “solid  density  plasma"””’  may  be  more  ap¬ 
propriate  than  “liquid,"  in  recognition  of  the  higher  ion¬ 
ization  states*'*  which  accompany  higher  temperature  and 
pressure  conditions.  However,  there  is  no  sharp  demar¬ 
cation  between  the  two  regimes,  and  both  may  be  encom¬ 
passed  by  the  term  “fluid.” 

C.  Temperature  and  pressure  following  femtosecond  excitation 

For  F>F,„,  the  quasiequilibrium  temperature  Tf  of 
the  liquid  established  by  A/  ~  1  ps  can,  in  principle,  be 
calculated  by  a  generalization  of  Eq.  (1)  which  takes  into 
account  variations  in  the  absorption  coefficient  a(r)  dur¬ 
ing  the  pump  pulse,  and  the  heat  capacity  ^,^^^(7)  of 
the  liquid  phase.  In  practice,  the  variations  in  all)  are 
quite  complicated  and  C|,^|mj(  'F)  can  only  be  estimated, 
thus  precluding  any  more  than  r.  rough  estimate  of  T,  of 
the  liquid.  The  significant  saturation  (decrease  in  a) 
which  occurs  at  fluences  approaching  F„  for  HOPG  has 
already  been  noted.  On  the  other  hand,  above  F„,  the 
onset  of  melting  and  metallic  properties  probably  in¬ 
creases  a  later  in  the  pump  pulse.  Thus  as  a  compromise 
the  linear  absorption  depth  a  ’=300  A  of  HOPG  pro¬ 
vides  an  approximate  basis  for  estimating  Tf.  On  the 
short  lime  scale  of  A/  ~  1  ps,  volume  expansion  can  be 
neglected  {f)=p„~  2. 25  g/cm  *  for  HOPG),  and  the  heat 
capacity  C,,  at  constant  volume  should  be  used.  The 
specific  heat  (3/2)iVkp  of  a  classical  monatomic  gas  pro¬ 
vides  a  basis  for  estimating  for  C|„,,„j(  T).  Internal  vibra¬ 
tional  modes  in  the  liquid  would,  of  course,  increase  this 
value  toward  the  Dulong-Pctit  value  of  3.VA.„,  The  cu.i 
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sured  pump  se!f-reflectivities  R(F„)  in  Fig.  3(a)  can  be 
used  directly  in  Eq.  1).  Combining  these  approxima¬ 
tions,  the  temperature  estimates  on  the  upper  horizontal 
axis  of  Fig.  8  were  obtained,  and  indicate  that  tempera¬ 
tures  as  high  as  100000  K  can  be  reached  under  our  ex¬ 
perimental  conditions.  The  expected  accuracy  of  this 
scale,  however,  is  no  better  than  ±50%  in  view  of  the 
above  approximations. 

When  the  sample  is  heated  and  probed  on  a  time  scale 
shorter  than  the  characteristic  time  for  thermal  expan¬ 
sion,  transient  pressure  exists  because  the  sample  is  iner- 
tially  confined  to  a  constant  volume.  The  magnitude  of 
the  pressure  depends  on  the  degree  of  equilibration 
within  the  phonon  system.  As  an  upper  limit  which  as¬ 
sumes  an  equilibrated  phonon  system,  the  pressure  p  of  a 
solid  heated  to  /cg  T  <  10  eV  (assuming  —  T,  =  T)  at  the 
constant  density  p,  of  the  low  temperature  solid  is  deter¬ 
mined  primarily  by  anharmonic  ionic  motion,  and  is  de¬ 
scribed  by  the  appro.ximate  formula  p  ~p,~~3NykgT, 
where  y  is  the  Gruneisen  parameter.*^  More  precisely, 
total  pressure  p  also  includes  contributions  from 
Thomas-Fermi  electron  pressure,  and  a  correction  for 
chemical  bonding,  but  for  kgT  <\0  eV  and  p~p,,  the 
ionic  contribution  p,  strongly  dominates.'”  The  dashed 
curves  in  Fig.  10  plot  p,a„iT)  for  liquid  carbon  for  fixed 
densities  corresponding  to  graphite  (2.25  g/cm^)  or  dia¬ 
mond  (3.5  g/cm^).  The  solid  curves  show  p  (  T)  corrected 
for  the  electronic  and  bonding  contributions  [derived 
from  Eqs.  (81)  and  (106)  of  Ref.  44].  For  completeness, 
the  curves  have  been  extended  to  T  <T,„,  using  the  cor¬ 
responding  expression  for  p,^„{p,T)  for  a  high  tempera¬ 
ture  solid  [Eq.  34(b)  of  Ref  44].  Figure  10  shows  that 
transient  pressures  of  approximately  1  megabar  can  exist 
at  T>T„.  The  time  scale  r„|  for  release  of  the  pressure 
is  estimated  from  the  ratio  of  the  heated  (absorption) 
depth  a  ‘  to  the  speed  of  sound  c, .  For  HOPG,  we  esti¬ 
mate  a  ~ ' /Cj  ~  5  ps,  using  a  '  ~500  A  and  c,  ~  lO*" 
cm/s.  This  estimate  confirms  that  solid  density,  and  thus 
elevated  pressure,  still  exist  at  At  ~  1  ps,  and  simultane¬ 
ously  supports  the  argument  that  the  reflectivity  decrease 
observed  at  At  >  5  ps  corresponds  to  the  expansion  of  ini- 


FIG.  10.  Effective  pressure  of  carbon  sample  following  con- 
stam  \  olume  heating  at  density  of  graphite  (2.25  g/cm')  or  dia¬ 
mond  '3.5  g/cm’l,  calculated  using  approximate  equation  of 
state  formulas  described  in  Ref.  44.  Dashed  curves  include  the 
dominant  ionic  contribution  only;  solid  curves  include  both  ion¬ 
ic  and  Thomas-Fermi  electronic  contributions  to  pressure. 


tially  pressurized  material.  Rothenberg"*^  has  measured 
such  a  pressure  release  time  in  picosecond-irradiated 
solid  silicon  using  time-resolved  photothermal  deflection 
spectroscopy. 


V.  DRUDE  DIELECTRIC  FUNCTION 
OF  LIQUID  CARBON 


We  now  analyze  in  detail  the  measured  optical  proper¬ 
ties  of  the  fluid  state.  This  analysis  is  compared  directly 
to  optical  properties  predicted  by  molecular  dynamics 
simulations,  and  to  pulsed  electrical  measurements  of  dc 
conductivity  of  liquid  carbon. 

The  dielectric  function  of  most  liquid  metals  is  well  de¬ 
scribed  by  a  Drude  function;^ 


e(ra)  =  c,j  — 


(O 


V 


1  +0)^1^ 


(4) 


Success  in  fitting  the  optical  properties  of  the  related 
column  IV  liquid  metals  Si,^^  ”  Ge,”  and  Sn,'*^  with  this 
model  especially  motivates  its  application  to  liquid  car¬ 
bon.  Usually  the  polarizability  (f,^)  and  residual  inter¬ 
band  absorption  (e,^. )  of  the  ionic  cores  can  be  neglected 
(€|,  =1,  e,^=0l,  leaving  the  free  electron  plasma  fre¬ 
quency  (Op=^TTn^.e- /m  (rt^=electron  density,  m  =free 
electron  mass)  and  relaxation  time  r  as  the  dominant  pa¬ 
rameters  which  determine  surface  reflectivity.  We  shall 
follow  this  simplifying  assumption  initially,  then  later  ex¬ 
amine  the  effect  of  introducing  finite  core  dielectric  pa¬ 
rameters.  Full  determination  of  the  Drude  parameters 
(Op  and  T  requires  at  least  two  independent  measurements; 
measurements  at  steep  incidence  angle  usually  provide 
the  greatest  precision  in  determining  e{(o).  Our  measure¬ 
ments  can  be  summarized  in  two  categories:  (1)  the  nor¬ 
mal  incidence  reflectivity  spectrum  Rihv)  over  the  range 
1.771hv54.0  eV  (see  Figs.  6-8)  and  (2)  oblique  in¬ 
cidence  16  =  50°)  s-  and p-polarized  reflectivity  at  hv  =  2.0 
eV  [e.g..  Figs.  4(a)  and  4(d)].  Measurements  at  At  ~  1  ps 
at  each  of  several  pump  fluences  are  used  for  the  analysis. 
Any  choice  of  Drude  parameters  must  be  consistent  with 
all  measurements  for  a  given  pump  fluence. 

Figure  1 1  plots  the  normal  incidence  reflectivity  spec¬ 
trum  at  At  ~  1  ps  for  two  fluences  (large  solid  squares; 
0.5  J/cm’;  open  squares:  2.0  J/cm‘),  as  gleaned  from  the 
data  in  Figs.  6-8,  This  data  is  compared  to  the  measured 
reflectivity  spectra  of  solid  graphite  (solid  curve),  dia¬ 
mond  (crosses),  and  liquid  silicon  (open  circles).  The 
measured  reflectivity  decreases  monotonically  with  in¬ 
creasing  photon  energy  hv,  and  equals  or  exceeds  the 
reflectivity  of  the  corresponding  solids  throughout  the 
probed  frequency  range.  At  visible  frequencies  the  mea¬ 
sured  reflectivity  increases  with  pump  fluence  up  to  a 
maximum  of  about  0.5,  although  at  hv  =  4.0  eV, 
reflectivity  was  independent  of  fluence  within  experimen¬ 
tal  error.  The  first  column  of  Fig.  12  plots  the  measured 
s-  and  p-polarized  oblique  incidence  reflectivity  at 
hv=2.0  eV,  At  -  1  ps,  at  F  -0,5  J/cm’,  compared  to  the 
normal  incidence  reflectivity  at  the  same  probe  wave¬ 
length. 

The  remaining  curves  in  Figs.  1 1  and  12  are  theoretical 
models  of  the  reflectivity  spectrum,  which  we  now  discuv>. 
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FIG.  n.  Normal  incidence  reflectivity  spectra  of  crystalline 
diamond  (Ref.  27)  (crosses);  crystalline  pyrolytic  graphite  (Ref. 
26)  (solid  curse);  carbon  measured  at  At  I  ps  after  a  fem¬ 
tosecond  melting  of  graphite  with  pulse  of  0.5  J/cm’  ilarge  solid 
squares!  or  2.0  J/cm’  (large  open  squares);  liquid  carbon  calcu¬ 
lated  using  Drude  parameters  =  .1. 5  X  10'*'  s 
r  =  0.25X10  s  (dotted  curve!  and  =  1.6X  10"'  s 
7  =  0.9X10  "  s  'dashed  curve);  liquid  carbon  calculated  from 
molecular  dynamics  simulation  at  atmospheric  pressure  (Ref.  6) 
ismall  solid  squares!;  liquid  silicon  measured  by  ellipsometric 
methods  (Ref.  25'  (open  circles). 


further.  The  small  solid  squares  in  Fig.  1 1  represent  a 
reflectivity  spectrum  R(cu)  extracted  from  the  conduc¬ 
tivity  function  ct(cu)  derived  by  Galli  ct  al.’'  from  a 
molecular  dynamics  simulation  of  liquid  carbon  [see  Fig. 
2  of  Ref.  6(3)]  for  the  following  conditions:  low  pressure 
P-~  \  atm,  =r,  =5000  K,  density  p  =  2.0  g/cm'.  To 
obtain  this  RUo),  Imcr(w)  was  derived  by  Kramers- 
Kronig  transformation  of  Rerrlw)  given  by  Galli  et  al.;" 
then  f(w),  expressed  in  terms  of  complex  conductivity, 


FIG.  12.  Oblique  (W=  50’)  incidence,  s-  and  p-polari/cd,  and 
normal  <0  0‘)  incidence  reflectisits  of  liquid  carbon  at  photon 

energy  2  eV,  'a'  measured  at  A(  -  I  ps  after  a  femtosecond 
niellmg  of  graphite  with  pulse  of  0.5  J/cm  :  h'  calculated  from 
molecular  dynamics  simulation  at  atmospheric  pressure  'Ref  6i; 
'C!  calculated  using  Drude  parameters  <  >,  =  .t  5  x  10''  s 
r  f)  2,  0.25.  0  .Vx  10  s;  idi  calculated  using  Drude  parame¬ 
ters  fc,  1 . 6  x  10  '  s  r  0  X,  0 I,()X  10  "  s. 


was  used  to  derive  R  icj).  Alternatively,  a  nearly  identi¬ 
cal  /?(aji  was  obtained  over  the  frequency  range  of 
our  data  by  fitting  Recrlw)  to  a  Drude  function 
a>~r/4-i  1  ),  then  substituting  the  resulting  best 

fit  Drude  parameters  =  3. 5  X  lO'*’  s  '  and  r  =  0. 67 
XIO*'*’  s  ',  into  the  dielectric  function  Eq.  (4)  with 
6i,.  =  1,  e2.  Physically,  the  best  fit  oi^  corresponds  to 
n ,  =  3. 8  X  10"'  cm  ’  (assuming  a  free  electron  mass),  very 
close  to  four  times  the  atomic  density  n„  =  1  X  lO"’  cm 
corresponding  top  =  2. 26  g/cm',  implying  a  nearly  qua¬ 
drivalent  metal.  The  best  fit  r  implies  a  mean-free  path 
l=i/^r=1.8  A  (L7.  =  Fermi  veltxiity  =2.6X10"  cm/s), 
close  to  the  average  interatomic  spacing  '^'  =  2,1  A. 

The  second  column  of  Fig.  12  shows  the  oblique  in¬ 
cidence  reflectivity  calculated  from  Fresnel’s  equations, 
using  the  same  dielectric  function. 

The  calculated  reflectivity  is  approximately  50‘7F 
higher  than  the  observed  reflectivity  (Figs.  11  and  121. 
well  outside  of  experimental  inaccuracy.  We  therefore 
consider  possible  causes  of  this  significant  discrepancy. 
One  possibility  is  that  core  dielectric  terms  must  be  in¬ 
cluded  along  with  the  best  fit  Drude  parameters.  Howev¬ 
er.  introducing  a  core  polarizability  as  large  as  that  of 
solid  graphite  (e^  =4.5)  together  with  the  best-fit  Drude 
parameters  cited  above  yields  RI2  eV)  =  0.58  and  Rl4 
eV|  =  0.45,  still  well  above  the  observed  values.  We  have 
investigated  systematically  combinations  of  core  dielec¬ 
tric  terms  in  the  ranges  1  S  fi,  5  4, 5  and  0  5  e,,  f  4.0,  and 
find  generally  that  core  dielectric  terms  cannot  explain 
the  discrepancy  shown.  We  most  conclude  that  the 
Drude  parameters  themselves  are  different  from  the 
values  extracted  from  the  simulations  of  Galli  ct  a/.,*'  or 
that  the  Drude  function  itself  does  not  adequately  de¬ 
scribe  the  properties  of  carbon  1  ps  after  photoexcitation. 
Close  agreement  should  not  necessarily  be  expected,  be¬ 
cause  our  experiment  probes  a  sample  at  high  pressure, 
and  densities  close  to  those  of  the  initial  solids  (graphite: 
2.26  g/cm’;  diamond:  3.5  g/cm').  The  discrepancy  pro¬ 
vides  motivation  for  further  simulations  of  the  electronic 
properties  of  liquid  carbon  under  conditions  closer  to 
those  of  the  experiment.  In  fact,  a  molecular  dynamics 
simulation  at  high  pressure  has  been  reported,"  '”  but  to 
our  knowledge  electrical  properties  were  not  derived.  In 
the  current  absence  of  such  simulations,  we  consider 
what  revisions  in  the  dielectric  function  can  better  ex¬ 
plain  the  reflectivity  data. 

The  dotted  curve  in  Fig.  1 1  shows  a  greatly  improved 
fit  to  R(<ij)  at  F  =  0.5  J/cm'  obtained  by  reducing  the 
value  of  7  from  0.67  to  0.25X10  s,  while  retaining 
other  parameters  (w^  =  3.5Xl0"’  s  '.  Cj,  =1,  e,^=0i 
corresponding  to  a  simple  nearly  quadrivalent  Drude 
metal.  The  third  column  of  Fig.  12  shows  that  the  ob¬ 
lique  incidence  reflectivity  at  /•’=0. 5  J/cm'  is  also  e.\- 
plained  almost  perfectly  by  this  dielectric  function,  and 
that  slightly  larger  (0.3 X  10  s)  or  smaller  (0. 2 X  10  ' 

s)  values  of  7  significantly  worsen  the  fit.  Nevertheless 
this  fit  is  not  unique.  Ihc  dashed  carve  in  Fig.  1 1  and  the 
fourth  column  of  Fig,  12  show  that  a  fit  of  neaily  equal 
quality  is  obtained  by  decreasing  o>^  to  1.6X10'"  s  . 
corresponding  to  a  monovalent  i/i  -  1  X  10''  cm  )  nu'- 
al.  and  increasing  7  to  0.9,X  10  s.  In  fact  a  family  ( 
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i(Op,T)  pairs  in  between  these  two  extremes,  plotted  as  a 
line  in  the  {cOp,T)  plane  in  Fig.  13(a),  provide  satisfactory 
fits  to  our  reflectance  data  for  F==0. 5  and  2  J/cm’.  Fits 
of  similar  quality  are  obtained  for  the  data  at  other 
fluences.  Thus  the  individual  Drude  parameters  are  not 
uniquely  determined  by  our  current  data.  On  the  other 
hand,  the  line  in  Fig.  13(a)  determines  the  product  o>pT 
and  the  dc  resistivity  p^^~4Tr/cOpT  to  within  relatively 
narrow  ranges,  as  shown  in  Fig.  13(b)  (e.g.,  1  <6jpr<  1.6 
and  400<pjg<500  ^Hcm  for  0.5  J/cm^).  Finally  Fig. 
13(c)  shows  the  dependence  of  the  optically  determined 


FIG.  13.  (a)  Family  of  Drude  parameters  and  r  which 
provide  satisfactory  fils  to  carbon  reflectance  data  at  Ar  —  I  ps 
after  femtosecond  melting  pulse  at  0.5  and  2.0  J/cm’;  lb»  dc 
resistivity  values  4-A.>:r  corresponding  to  family  of  Drude  pa¬ 
rameters  in  (a);  Ic)  dependence  of  optically  determined  resistivi¬ 
ty  on  pump  fluence,  showing  four  different  data  analyses  assum¬ 
ing  oj..  constant  at  either  1.6X10"’  s  '  (open  circles)  or 
3,5X10''’  s  '  (open  squares  and  r  variable  with  increasing 
fluence;  or,  conversely,  r  constant  at  either  0.  3  X  10  s  (solid 
squares)  or  l.OX  10  s  (solid  circles),  and  oj,  variable  with  in¬ 
creasing  fluence. 


Pd,.  on  pump  fluence.  Because  and  t  may  individually 
vary  with  fluence  in  a  range  of  possible  ways,  four  plots 
of  Pj^.  illustrating  the  extreme  cases  are  shown  in  Fig. 
i3(c):  dip  constant  (at  either  1.6  or  3.5X10’*  s  ')  and  r 
variable  with  increasing  fluence;  or,  conversely,  r  con¬ 
stant  (at  either  0.3  or  l.OX  10“  '*  s),  and  Wp  variable  with 
increasing  fluence.  In  all  cases,  p^^.  decreases  from 
625±75  fifl  cm  slightly  above  the  melting  fluence  to 
375±75  pflcm  at  tenfold  higher  fluence.  For 
0. 13<f<0. 3  J/cm^,  the  reflectance  data  was  first 
corrected  for  the  incomplete  overlap  of  the  melted  area 
with  the  probe  spot,  using  the  data  in  Fig.  1.  The  slight 
increase  in  p  for  F  <0.25  J/cm^  fails  within  experimental 
error.  The  trend  in  p  contrasts  with  the  usual  increase  of 
p  with  temperature  in  metals,  but  follows  the  usual  de¬ 
crease  of  p  with  pressure,  as  discussed  further  below. 

Table  I  compares  our  optically  derived  Drude  parame¬ 
ters  and  dc  resistivity  with  optical-derived  parameters  of 
the  other  column  IV  liquid  metals  (measured  at  atmos¬ 
pheric  pressure  slightly  above  their  respective  melting 
points),  and  with  pulsed  electrical  measurements  and 
theoretical  simulations  for  liquid  carbon.  Our  product 
iDpT—  \  is  significantly  smaller  than  for  the  other  column 

IV  liquid  metals,  indicating  an  unusually  strongly 

damped  electron  plasma  and  strong  electron  localization. 
Similarly,  the  optically  determined  is  significantly 
higher  than  those  of  the  other  metals.  The  /-C  values  are 
consistent  qualitatively  with  the  trends  of  decreasing  ojpT 
and  increasing  p^^.  as  atomic  size  decreases  in  column  IV. 
The  unusually  large  size  of  the  changes  between  /-Si  and 
/-C  is  consistent  with  the  much  higher  temperature  re¬ 
quired  to  melt  carbon,  which  in  turn  strongly  reduces  the 
electron  collision  time  r.  Nevertheless,  the  unusually 
small  numerical  values  of  r  (0.3  <  r  <  l.OX  10“  '*  s)  for  /- 
C  compared  to  optically  determined  r  values  in  /-Si 
(2. 1  X  10  ">  .s),  /-Ge  (2.5X  10  '*  s),  and  /-Sn  (3.5X  10”  '* 
s)  warrant  further  comment,  since  they  imply  electron 
mean  free  paths  l  —  Vf-r  in  the  range  0. 8</<1.7  A, 
which  are  actually  somewhat  smaller  than  the  average  in¬ 
teratomic  spacing  (/V  '^^~2  A).  should  set  an 

approximate  lower  limit  on  the  path  between  electron 
scattering  events,  leading  to  “saturation"  of  p^  and  r 
with  increasing  temperature,'*'*  as  has  been  observed  in 
aluminum  under  short  pulse  heating  conditions.*’  A  pos¬ 
sible  physical  mechanism  yielding  /  slightly  less  than 
N  '  *  might  be  collisions  within  localized  carbon  mole¬ 
cules  containing  small  angle  bonds.  Alternatively,  a  re¬ 
sidual  population  of  anomalously  hot  electrons  with 

V  >  Vy  would  have  mean-free  ,„.ths  larger  than  the  range 
indicated  above.  On  the  other  hand,  in  view  of  the  phe¬ 
nomenological  nature  of  the  Drude  model,  the  precise 
value  of  7  should  probably  not  be  interpreted  too  literal¬ 
ly,  as  long  as  VyT  saturates  in  the  vicinity  of 
Indeed  the  fluence  dependence  shown  in  Fig.  13(c)  is  fully 
consistent  with  a  saturated  resistivity  above  the  melting 
fluence,  since  resistivity  does  not  increase  as  fluence  in¬ 
creases  above  F,„  (with  the  possible  exception  of  a  slight 
increase  in  a  narrow  range  just  above  F,„  ).  In  fact,  rather 
than  leveling  off,  resistivity  actually  decreases  with  ap¬ 
proximately  F  dependence.  The  decrease  in  p  prob- 
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TABLE  I.  Summary  of  measurements  and  calculations  of  Prude  parameters  and  dc  resistivity  in  column  IV  liquid  metals. 


Liquid 

Pressure  (bars) 

Temperature  (K) 

ciij,r 

Method 

Reference 

Sn 

1 

1130 

8 

60 

cw  ellipsometry 

Hodgson  (Ref.  47) 

Ge 

1 

1270 

6 

75 

cw  ellipsometry 

Hodgson  (Ref.  47) 

Si 

1 

1800 

5.6 

80 

cw  ellipsometry 

Shvarev  et  at.  (Ref.  43) 

1 

1800 

5.3 

85 

Picosecond 

Li  and  Fauchet  (Ref.  25) 

reflectance 

C 

'0.3X10* 

-6000 

1-1.6 

625±75 

Femtosecond 

This  work 

reflectance 

4X  10^ 

-5000 

(000 

Ohmic  heating 

Gathers  et  al.  (Ref.  14(a)] 

1 

-4500 

30-70 

Ohmic  heating 

Heremans  et  al.  (Ref.  4) 

1 

5000 

2.3 

140128 

Molecular  dynamics 

Galli  et  al.  [Ref.  6(a)] 

simulation 

ably  has  multiple  causes,  including  increasing  ionization 
(i.e.,  increasing  oi^,  even  at  constant  t),  and  an  approach 
to  high  temperature  plasma  behavior,  in  which  resistivity 
is  increasingly  determined  by  Coulomb  scattering,’^  and 
scales  as  T 

The  range  of  Drude  parameters  cited  above  corre¬ 
sponds  to  a  range  of  possible  electronic  and  molecular 
structures.  The  smaller  oip  (1.6X10**  s“')  corresponds 
to  a  monovalent  metal,  implying  that  three  of  the  four 
valence  electrons  of  carbon  are  involved  in  covalent 
bonding.  Indeed  the  molecular  dynamics  simulation  of 
Galli  et  al.,^  although  consistent  with  a  quadrivalent 
electronic  structure  at  low  pressure,  nevertheless  shows 
prevalent  clusters  and  chainlike  molecular  structures  in 
the  liquid  state.  On  the  other  hand,  the  other  column  IV 
liquid  metals  are  all  quadrivalent  metals.^’  "*’  The  current 
results  provide  no  firm  basis  for  distinguishing  between 
these  two  pictures,  or  an  intermediate  version.  Molecular 
dynamics  simulations  of  the  electronic  properties  at 
elevated  pressure,  together  with  extended  femtosecond  el- 
lipsometric  measurements  at  very  steep  probe  incidence 
angles  and  vacuum  ultraviolet  probe  frequencies  will  nar¬ 
row  the  uncertainty  considerably,  and  possibly  detect 
variations  from  Drude-like  optical  properties.  Whichev¬ 
er  of  these  interpretations  proves  ultimately  correct,  the 
current  experiments  show  clearly  that  liquid  carbon  is  a 
resistively  saturated  material,  for  which  the  “free  elec¬ 
tron”  Drude  model  and  the  attribution  “metal”  are  at  the 
limits  of  their  validity. 

Comparison  of  our  optically  determined  resistivity 
with  pulsed  electrical  heating  measurements  of  liquid  car¬ 
bon  (see  Table  I)  also  provides  physical  insight.  The  mea¬ 
surements  which  most  closely  approximate  our  pressure- 
temperature  conditions  involve  pulsed  ohmic  heating  of 
vitreous*"*"'  or  glassy  carbon*"*'**'  to  temperatures  up  to 
6000  K  under  2-4  kbar  rare  gas  pressure.  Melt  resistivi¬ 
ties  p'-lCXX)  pflcm  were  reported  from  these  measure¬ 
ments,  which  is  very  consistent  with  our  optically  deter¬ 
mined  values  just  above  F„, .  In  fact  the  somewhat  small¬ 
er  values  obtained  in  our  experiment  are  fully  consistent 
with  the  higher  sample  compression  under  femtosecond 
excitation,  which  suppresses  the  amplitude  of  ionic  vibra¬ 
tional  motion  responsible  for  electron  scattering  and 
resistivity.  Indeed  clear  evidence  of  volume  expansion 
was  obtained  in  the  pulsed  ohmic  heating 


measurements,'"*'"’  and  was  taken  into  account  in  deter¬ 
mining  the  resistivity.  On  the  other  hand,  recent  mea¬ 
surements*  and  calculations*  ’  at  ambient  atmospheric 
pressure  have  yielded  much  smaller  resistivities  than  ob¬ 
tained  in  our  experiment.  Heremans  et  al*  reported 
resistivity  30 <p  <  70  pfl  cm  based  on  resistance  measure¬ 
ments  of  ohmically  heated  pyrolytic  graphite  fibers  at  at¬ 
mospheric  pressure.  Two  theoretical  simulations  per¬ 
formed  after  this  experiment  yielded  resistivities  of  140 
fiQ.  cm,*  and  80  pH  cm’.  Volume  expansion  upon  melt¬ 
ing  was  neglected  in  analyzing  the  measured  resistance, 
which,  if  present  to  the  degree  observed  in  Ref.  14(c), 
could  raise  the  reported  resistivity  by  as  much  as  a  factor 
of  2.  Nevertheless,  even  with  this  correction  the  values 
would  still  fall  significantly  below  ours.  The  lower  sam¬ 
ple  pressure  compared  to  our  conditions  is  expected  to 
yield  higher,  rather  than  lower  resistivity  (in  the  absence 
of  phase  transitions).  Thus  other  causes  of  the  discrepan¬ 
cy  with  our  measurements  must  be  sought.  One  possibili¬ 
ty  is  that  the  nature  of  the  material  is  very  different  at  at¬ 
mospheric  pressure.  Indeed,  a  number  of  independent 
experiments  in  which  graphite  was  melted  in  atmospheres 
of  pressurized  inert  gases  have  agreed  that  the  solid- 
vapor-liquid  triple  point  occurs  at  ~110  atm. 
pressure.’'"'-’*  Thus  according  to  these  investigators,  an 
equilibrated  liquid  phase  of  carbon  does  not  even  exist  at 
atmospheric  pressure.  It  is  therefore  conceivable  that  at¬ 
mospheric  pressure  experiments  measure  a  qualitatively 
different  phase  of  carbon  than  high  pressure  experiments. 
Reconciliation  of  the  various  pulsed  ohmic  heating  exper¬ 
iments  is  complicated  by  their  use  of  different  starting 
forms  of  carbon,  and  by  the  lack  (to  our  knowledge)  of 
measurements  on  a  single  type  of  starting  sample  at  pres¬ 
sures  both  above  and  below  1 10  atm.  In  addition,  the  re¬ 
ported  electrical  resistivity  of  the  heated  phase  sometimes 
depends  on  prior  history  (e.g.,  heat  treatment)  of  the 
starting  solid.  These  ambiguities  would  be  clarified  by 
performing  ohmic  heating  experiments  on  well- 
characterized  pyrolytic  graphite  fibers,  glassy  carbon,  vit¬ 
reous  carbon,  and  perhaps  other  solid  carbon  forms  at 
systematically  varied  pressures  ranging  from  1  bar  to 
several  kbar.  In  summary,  our  optically  determined 
resistivity  agrees  very  well  with  pulsed  electrical  measure¬ 
ments  reported  for  carbon  samples  melted  at  pressures 
well  above  the  widely  accepted  triple  point  pressure  of 
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110  but  significantly  exceeds  reported  measure¬ 

ments*  and  calculations*’  ^  of  resistivity  for  low  pressure 
samples. 

VI.  CONCLUSIONS 

Femtosecond  melting  experiments  measure  the  optical 
and  electronic  properties  of  refractory  condensed  materi¬ 
als  in  quantitative  detail  at  extremes  of  pressure  and  tem¬ 
perature  inaccessible  by  other  experimental  methods.  We 
have  presented  a  comprehensive  study  of  the  optical 
properties  of  liquid  carbon  produced  by  femtosecond  ex¬ 
citation  of  pyrolytic  graphite  and  diamond  above  critical 
melting  fluences  of  0. 13  J/cm^  and  0.6  J/cm^,  respective¬ 
ly.  All  evidence  suggests  that  both  samples  melt,  reach 
nearly  common  electron-lattice  temperature  and  steady 
state  visible  reflectance  higher  than  the  starting  solids 
within  1  ps,  before  the  release  of  internal  pressure  and  hy¬ 
drodynamic  expansion  distort  the  optical  properties  of 
the  surface.  A  careful  analysis  of  fluence-dependent 
reflectance  at  At  >  1  ps  and  fluence-dependent  damage 
morphology  shows  that  surface  expansion  strongly  at¬ 
tenuates  surface  reflectivity  within  a  few  picoseconds  of 
femtosecond  irradiation  at  all  fluences  above  the  critical 
melting  fluence.  A  Drude  model  fit  of  the  measured 
reflectance  spectrum  at  At  ~  1  ps  shows  liquid  carbon  to 
be  a  resistively  saturated  metal  under  femtosecond  melt¬ 
ing  conditions,  consistent  with  dc  resistivity  measure¬ 
ments  made  on  carbon  samples  melted  by  pulsed  ohmic 
heating  in  an  inert,  pressurized  atmosphere. 

This  work  points  to  a  number  of  areas  for  additional 
research.  The  pressure  dependence  of  the  electrical  prop¬ 


erties  of  liquid  carbon  should  be  clarified  by  theoretical 
simulations  at  high  pressure  and  by  pulsed  electrical  mea¬ 
surements  on  well-characterized  samples  at  systematical¬ 
ly  varied  pressures  between  1  bar  and  several  kbar.  Fem¬ 
tosecond  experiments  can  be  improved  by  using  probe 
pulses  at  vacuum  ultraviolet  wavelengths  and  steep  in¬ 
cidence  angles,  in  order  to  probe  the  plasma  edge  more 
accurately  and  determine  individual  Drude  parameters 
more  precisely  than  has  been  possible  in  this  work. 
Significant  deviations  from  Drude-like  behavior  might 
also  be  detected.  The  degree  of  equilibration  following 
femtosecond  excitation  can  be  further  clarified  by  dynam¬ 
ical  simulations  which  start  with  graphite  and/or  dia¬ 
mond  crystalline  structures,  and  by  more  extensive  com¬ 
parison  of  the  femtosecond  time-resolved  reflectance  of 
less  refractory  materials  with  independently  measured 
optical  properties  of  the  equilibrated  liquid  phases.  Fi¬ 
nally,  in  a  larger  context,  femtosecond  spectroscopy  pro¬ 
vides  a  promising  method  for  optically  characterizing  a 
wide  range  of  condensed,  refractory  materials  in  previ¬ 
ously  inaccessible  pressure  and  temperature  regimes 
characteristic  of  planetary  and  stellar  interiors. 
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We  present  femtosecond  time-resolved  p-  and  s-polari2ed  reflectivity  measurements  of  silver,  aluminum,  and 
tungsten  surfaces  that  are  hydrodynamically  expanding  in  response  to  pulsed  laser  excitation.  A  numerical 
model  of  the  experimental  results  shows  that  (1)  electron  heating  and  surface  expansion  influence  the  reflec¬ 
tivity  on  distinguishable  time  scales  and  (2)  covalency,  described  by  non-Drude  dielectric  constants,  is  neces¬ 
sary  to  explain  the  time-resolved  reflectivity  of  the  tungsten  target. 


Intense  femtosecond  excitation  of  solid  targets  has 
been  used  to  produce  and  measure  the  properties  of 
solid  density  plasmas''^  and  pressurized  liquids^  that 
resemble  states  of  matter  found  in  planetary  and  stel¬ 
lar  interiors.  With  sufficiency  short  pulses,  the  sur¬ 
face  can  be  fully  heated  and  optically  probed  before 
the  surface  hydrodynamically  expands.  In  several 
recent  studies, the  ultrafast  reflectivity  response 
of  highly  excited  metals  was  observed  with  a  single, 
intense  laser  pulse  serving  both  as  target  excitation 
and  as  optical  probe.  In  these  studies,  the  initial 
stages  of  hydrodynamic  expansion  occurred  during 
the  pulse.  Consequently,  competing  processes  of 
electron  heating,  melting,  and  hydroi^namic  surface 
expansion,  which  influence  the  reflectivity  to  differ¬ 
ent  degrees  and  at  different  times,  could  not  be 
clearly  resolved. 

In  this  Letter  we  present  femtosecond  time- 
resolved  experiments  in  which  a  90-fs  visible  pump 
pulse  excites  either  a  free-electron  metal  (Ag,  Al)  or 
d-band  metal  (W)  target  at  intensities  of  as  much  as 
5  X  10'^  W/cm^.  The  front  surface  reflectivity 
Rp,,  (At)  of  a  90-f8,  obliquely  incident  p-  or 
s-polarized  probe  pulse  of  the  same  center  wave¬ 
length  is  then  monitored  as  the  pump-probe  time  de¬ 
lay  At  is  varied  from  -1  to  +20  ps.  The  probe 
Rp_,(At)  data  are  analyzed  at  each  At  by  numerically 
solving  the  Helmholtz  wave  equations.^  The  results 
show  that  (1)  electron  heating,  which  occurs  instan¬ 
taneously  on  surface  excitation,  can  be  temporally 
distinguished  from  the  delayed  surface  hydrody¬ 
namic  expansion  and  (2)  non-Drude  background  di¬ 
electric  constants  are  needed  to  explain  Rp,AAt)  of 
the  expanding  non -free- electron  metal  W  target, 
which  indicates  a  strong  influence  of  covalent 
bonding  in  the  liquid  and  the  expansion  region. 
Through  such  results  we  aim  to  understand  quanti¬ 
tatively  the  optical  properties  of  metal  surfaces  dur¬ 
ing  the  initial  stage  of  plasma  expansion  following 
short-pulse  excitation,  which  in  turn  may  aid  in  test¬ 
ing  hydrotfynamic  codes  used  to  model  inertial  con¬ 
finement  fusion  and  short-pulse  x-ray  generation. 

The  experiments  were  performed  in  air  immedi¬ 
ately  after  the  samples  were  polished,  and  the  initial 


optical  properties  were  determined  with  a  He-Ne 
laser.  A  colliding-pulse  mode-locked  dye  laser  and  a 
four-stage  Nd:YAG-pumped  dye  amplifier  generated 
0.5-mJ,  620-nm,  90-fs  pulses  at  a  10-Hz  repetition 
rate.  The  amplified  pulses  are  split  into  a  pump 
beam,  a  1000-fold  weaker  probe  beam,  and  a  refer¬ 
ence  beam,  which  is  used  for  normalizing  the  effect 
of  pulse  energy  fluctuations.  The  probe  is  incident 
at  70°  from  the  surface  normal  and  is  focused  to  a 
30  ^m  X  60  fjLvn  spot  on  the  sample.  This  steep 
angle  provides  good  sensitivity  to  changes  in  the 
surface  dielectric  function  and  density  gradient. 
The  p-polarized  pump  is  incident  at  45°  and  focused 
to  a  60  fim  x  80  pm  spot  centered  around  the  probe 
spot.  The  -  5-ns  pedestal  of  amplified  spontaneous 
emission  was  measured  to  be  less  than  A%  of  the 
pump-pulse  energy,  well  below  the  threshold  for  pre- 
plasma  formation.  The  sample  was  translated  after 
each  shot  to  ensure  that  each  pump  pulse  excited  an 
undamaged  area.  Each  data  point  was  averaged 
over  20  shots. 

RpAAt)  data  for  the  free-electron  metals  Ag  and  Al 
excited  at  10*^  W/cm^  are  shown  in  the  main  plot  and 
the  inset  of  Fig.  1(a),  respectively.  Rp{At)  decreases 
with  the  pulse-width-limited  fall  time  from  an  initial 
value  of  0.88  (Ag)  or  0.79  (Al)  to  a  final  value  of 
-“0.35  after  excitation.  After  reaching  a  minimum, 
Rp(At)  partially  and  more  slowly  recovers  to  approxi¬ 
mately  60%  of  the  original  value  within  15  ps.  For 
At  >  15  ps,  Rp(At)  levels  off  or,  with  higher  pump  in¬ 
tensity,  begins  to  decrease.  RAAt)  of  identically  ex¬ 
cited  Ag  and  Al,  on  the  other  hand,  changes  only 
slightly  during  the  interval  0  <  Al  <  15  ps.  More¬ 
over  its  near-unity  value  shows  that  nonspecular 
scattering  is  weak  during  this  time  interval.  For 
pump  intensities  of  >10‘®  W/cm^,  RAAt)  decreases 
gradually  for  At  >  10  ps,  which  suggests  the  onset  of 
nonspecular  scattering  as  plasma  nonuniformities 
develop,  but  still  only  slightly  for  At  <  10  ps. 

Rp,AAt)  data  for  the  d-band  metal  W  identically  ex¬ 
cited  and  probed  are  shown  in  Fig.  Kb).  The  inset 
shows  RpiAt)  for  excitation  at  5  x  10^*  W/cm*.  In 
this  case,  solid  W  simply  melts  in  <1  ps,  without  ab¬ 
lating,  into  a  liquid  that  is  more  reflective  than  the 
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Fig.  1.  Time-resolved  reflectivity  of  iat  Ag  and  A1  (inset) 
and  lb)  W  surfaces  pumped  by  p-polarized,  90-fs,  620-nm 
pulses  incident  at  45°  from  the  normal  at  10*^  and 

probed  by  s-polarized  mpen  circles)  or  p-polarized  (filled 
circles)  pulses  (90  fs,  620  nm)  incident  at  70°.  Curves 
represent  theoretical  models  based  on  numerical  solutions 
of  Helmholtz  equations,  as  follows;  (a)  Dotted  curves, 
electron  heating  included,  surface  expansion  neglected; 
dashed  curves,  electron  heating  neglected,  surface  expan¬ 
sion  included;  solid  curves,  electron  heating  and  surface 
expansion  both  included;  long-dashed  curves,  covalent 
model  i.s  polarized),  ib'  Dashed  curves,  free-electron 
model;  solid  curves,  covalent  model;  inset:  p-polarized 
data  at  5  X  10'^  W/cm^ 

solid,  so  Rp  remains  nearly  constant  over  20  ps. 
When  the  excitation  was  increased  to  10”  W/cm^ 
[Fig.  lib),  main  plot],  however,  an  initial  Rp  increase 
to  the  liquid  value  1-0.26)  is  followed  by  a  drop  and 
recovery,  as  observed  in  A1  and  Ag.  /?.,  also  in¬ 
creases  by  approximately  10%  immediately  on  exci¬ 
tation  then  decays  slightly  over  the  15-ps  period  of 
observation. 

We  now  analyze  R^  Jlt)  for  Ag,  Al,  and  W  by  nu¬ 
merically  solving  the  Helmholtz  equations'  at  each 
At  using  a  model  that  includes  (1)  heating  and  subse¬ 
quent  cooling  of  the  electrons  and  (2)  development 
of  a  plasma  gradient  that  expands  at  velocity 
c.xp  —  c,,unrt  normal  to  the  metal  surface.  A  space- 
and  time-dependent  Drude  dielectric  function  e(x)  = 
fifllx)  if-2n(x)  -  Mp^ix>/(dloj  +  ii'(x)]  describes  the 
surface  expansion  region  at  each  time  At  after  exci¬ 
tation,  where  is  the  electron  collision  frequency. 


(Up  =  (n,e^/co^)‘  ^  is  the  plasma  frequency, 
-f-  iv)  is  the  free-electron  contribution  to 
the  dielectric  function  that  is  due  to  intraband  tran¬ 
sitions,  while  fio  -  1  and  €20  describe  the  covalent 
electron  contribution  from  interband  transitions. 

The  optical  properties  of  liquid  Ag  and  AI  are  well 
explained  by  a  free-electron  Drude  dielectric  func¬ 
tion  at  visible  frequencies. °  Thus,  as  a  first  ap¬ 
proximation,  we  set  Cio  =  1  and  620  =  0,  so  that  thex 
dependence  of  e(x)  in  the  expansion  region  is  con¬ 
tained  entirely  in  the  x  dependence  of  the  atomic 
density  ATlx),  as  aip^(x)  and  uix)  are  linearly  propor¬ 
tional  to  Nix).  The  dotted  curves  in  Fig.  1(a)  were 
calculated  based  on  electron  heating  and  cooling 
only,  without  hydrodynamic  surface  expansion,  as 
follows.  First,  the  time  evolution  of  electron  tem¬ 
perature  Tt(/)  was  calculated  a  priori  by  solving  the 
coupled  diffusion  equations.®  As  a  lower  limit,  we 
used  room-temperature  electron-phonon  coupling 
constants  of  3.6  x  10”  W/m^  K  for  Al  and  3.6  x 
10'®  W/m^K  for  Ag.'  The  calculated  peak  T,  is  in¬ 
sensitive  to  a  threefold  increase  of  the  coupling  con¬ 
stant.  In  solving  the  equation,  a  Gaussian  laser 
temporal  profile  provided  the  heat  generation  term 
that  determines  the  rise  time  of  and  the  T,  depen¬ 
dence  of  the  electron  heat  capacity  and  chemical 
potential  was  included®  to  ensure  accurate  descrip¬ 
tion  of  the  transition  from  degenerate  to  Maxwellian 
electrons.  Room-temperature  thermal  conduc¬ 
tivities  were  used.  Ballistic  heat  transport  was 
neglected  here  because  of  the  much  shorter  electron 
mean  free  path  at  high  T^.  These  calculations 
yielded  peak  electron  temperatures  of  10,  14.3,  and 
9.41  eV  for  Al,  Ag  and  W,  respectively,  at  10' '  W/cn»'^ 
Second,  TM)  was  related  to  e(f),  and  therefore  to 
RpAht),  by  using®  pit)  =  Cn,Z^TM)lp>p^  is  the  ion 
density,  Z  is  the  effective  ionization  number),  which 
is  valid  for  kT,.  <  20  eV.  The  parameter  C  involves 
the  structure  factor  and  other  weak  temperature- 
dependent  factors.  However,  C  can  be  regarded 
as  a  temperature-independent  constant  and 
wes  the  sole  parameter  used  to  fit  the  Rp  minima  for 
Al  and  Ag  at  several  excitation  levels.  With  top/w 
equal  to  the  liquid-state  values  [4.52  (Ag)  and  7.31 
(Al)]®  on  melting  and  Z  =  3  (Al)  or  1  (Ag),  the  dotted 
curves  in  Fig.  1  are  generated,  which  correspond  to 
maximum  collision  frequencies  of  vai/w  =  1.88  and 
('ak/w  =  0.87  for  10''®  W/cm^  excitation.  The  ob¬ 
served  Rp  minima  at  several  other  excitation  levels 
of  Al  and  Ag  were  also  consistent  with  this  electron 
heating  analysis  to  within  10%.  Finally,  the  fitted 
value  vai  =  5.7  X  10'®  s  '  is  close  to  the  correspond¬ 
ing  value  V  =  6.1  ±  0.4  X  10'®  s  '  derived  from 
Ref.  1  for  Al  at  T,.  =  10  eV,  These  independent 
checks  all  confirm  that  the  rapid  drop  in  Rp  and  the 
Rp  minima  are  determined  almost  entirely  by  elec¬ 
tron  heating.  However,  the  calculated  electron 
cooling  shown  by  the  dotted  curves  fails  to  explain 
the  slow  observed  recovery  of  Rp  without  invoking 
unreasonably  small  phonon  coupling  constants. 

On  the  other  hand,  the  recovery  of  Rp  can  be  ex¬ 
plained  well  by  introducing  hydrodynamic  surface 
expansion  at  approximately  c,„„nd-  The  dashed 
curves  in  Fig.  1(a)  show  the  calculated  Rp(\t)  values 
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of  A1  and  Ag  by  considering  expansion  alone,  without 
electron  heating.  This  calculation  assumed  that 
the  atomic  density  gradient  function  in  the  expan¬ 
sion  region  follows  the  Riemann  solution  of  the  hydro- 
dynamic  equations N(x)  =  iVo(3/4  - 
for  <  X  <  3ieipL  Expansion  velocities  v„  = 

1.2  X  10®  cm/s  (All  and  0.8  x  10®  cm/s  (Ag),  assumed 
constant  over  the  initial  15  ps  of  expansion,  were 
then  determined  from  the  best  fit  to  the  observed 
recovery  and  agree  well  with  the  acoustic  velocities 
independently  calculated^  with  the  peak  electron 
temperatures  cited  above,  vo  =  (ZkaT,/mJ^‘^  =  1.16 
and  0.36  x  10®  cm/s,  respectively.  Nevertheless, 
this  model  alone  cannot  explain  the  pulse-width- 
limited  fall  time  of  JRp.  However,  when  both  in¬ 
creased  collisionality  and  plasma  expansion  are 
taken  into  account  with  the  same  calculated  vitl/w 
values  and  fitted  ti„p  parameter  cited  above  for  the 
dotted  and  dashed  curves,  a  satisfying  fit  to  RpU) 
of  Ag  and  A1  is  obtained  for  the  entire  interval 
0  <  /  <  15  ps,  as  shown  by  the  solid  curves  in 
Fig.  1(a).  The  final  curves  were  insensitive  to  the 
recovery  dynamics  of  T,(0.  A  small  drop  in  /?,(/) 
near  t  =  0  is  also  expected,  but  with  magnitude 
close  to  the  experimental  noise  level. 

In  the  case  of  W,  if  we  make  the  free-electron  as¬ 
sumption  (i.e.,  cio  =  1,  ejo  =  0)  for  the  liquid  and  ex¬ 
panding  material,  we  find  that  the  postmelting 
reflectivity  drop  and  recovery  shown  in  Fig.  Kb) 
cannot  be  explained  by  a  surface  expansion  model 
like  that  described  above.  For  example,  the  dashed 
curves  in  Fig,  Kb’  show  our  best  electron  heating/ 
surface  expansion  model  under  the  constraints 
eio  =  1  and  ejo  =  0  but  disagree  strongly  with  both 
the  Rp(i)  and  R,(t)  data.  For  these  curves  viOi/tu  = 
3.65  and  Wp/w  =  9.07  were  used  in  order  to  satisfy 
the  initial  reflectivity  values  of  molten  VV  How¬ 
ever,  no  better  fit  could  be  obtained  despite  wide 
variations  in  the  choice  of  parameters.  Under  our 
conditions,  recombination  is  negligible”  in  the  ex¬ 
pansion  region  over  the  initial  15  ps. 

To  obtain  a  satisfactory  fit  of  the  W  data,  covalent 
background  dielectric  constants  Cio  >  1  and  cao  >  0 
must  be  introduced  in  both  the  liquid  and  expansion 
regions.  The  solid  curve  in  Fig.  Kb)  shows  a  sati' 
fying  example  of  such  a  fit  to  both  Rpit)  and  RJO 
data,  with  parameters  as  follows.  For  the  liquid 
surface,  €<0  =  7.65,  €20  =  12.3,  v/tu  =  0.7,  and  wp/w  — 

4.3  were  introduced  because  they  are  consistent 
simultaneously  wuh  the  initial  maximum  liquid  W 
reflectivity  and  the  known  resistivity'*  of  W  near 
the  melting  point.  The  fit  of  subsequent  evolution 
then  assumed  this  liquid  ?s  an  initial  condition. 
Continued  heating  of  the  liquid  drives  v/w  to  a  maxi¬ 
mum  value  of  1.8  at  the  Rp  minimum,  then  electrons 
cool  in  —3  ps.  The  observed  Rp  minima  imply  that 
the  structure  parameters  for  the  three  metals  are  in 
the  ratio  Cai  C’  g  Cw  =  0.99:1.1:1.0  with  reasonable 
liquid-state  parameters  Z  =  1.7  and  the  effective 
optical  mass  m,‘  =  2.0m,  [m,*/m,  =  1  for  Al  and  Ag 
I  Ref.  5)].  In  the  expansion  region  (x  >  -Vot),  cio 
and  €20  decrease  with  material  density,  approach.ng 
the  vacuum  values  (en  1,  €20  "=  0)  far  from  the 
sunace.  We  made  the  simplifying  assumption  that 


(€so  -  1)  and  di  crease  linearly  with  density  to¬ 
ward  the  vacuum  values.  The  best  fit,  shown  by  the 
solid  curve  in  Fig.  Kb),  was  then  obtained  with 
=  0.33  X  10“  cm/s,  which  is  the  same  as  the  c;.!- 
culated  value  v„.  More  rigorous  models  of  the  den¬ 
sity  dependence  of  Sjo  and  e^o  would  necessitate 
modest  changes  in  all  parameters.  Nevertheless, 
this  simplified  model  demonstrates  the  influence  of 
covalency  both  in  the  liquid  and  the  expansion  region 
of  W'  Similar  conclusions  are  reached  from  experi¬ 
ments  with  other  covalently  bonded  materials  (e  g  , 
carbon,  silicon). 

The  slight  deviation  of  liquid  Ag  and  Al  (Ref  5) 
optical  properties  from  a  free-electron  dielectric 
function  can  also  be  incorporated  easily  into  the 
model.  Optica]  data®  justify  tw  =  1.3,  €20  ~  12,  and 
u/u>  =  0.085  for  Ag  and  <10  =  1.2,  tju  --  1.1.  and 
v/(A>  =  0.31  for  Al  at  620  nm,  which  yield  the  dashed 
curves  for  R.(t)  in  Fig.  Kai.  The  Rpiti  fits  hardly 
deviate  from  the  previous  ones,  but  the  subunity  val¬ 
ues  of  R,it)  are  better  explained  For  ail  the  best 
ftp,  fits,  the  data  begin  to  deviate  from  the  model 
for  /  >  15  ps.  This  discrepancy  may  be  caused 
by  the  onset  of  nonspecular  scattering,  as  shown  by 
recent  time-resolved  measurements"  from  femto¬ 
second  photoexcited  GaAs  that  show  the  scattering 
rising  sharply  for  >  15  ps.  Thus  the  surface  ex¬ 
pansion  model  based  on  the  Helmholtz  equations 
appears  valid  only  in  a  limited  time  interval 
0  <  At  <  15  ps 
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Electroma^etic  plaama  computer  limulatiooa  are  ueed  to  analyze  the  frequency  ahifu  caused  by  the  ionization 
of  atmoepheric-denaity  noble  gaaea  during  interaction  with  intcnae  femtoaecond  laser  pulse* ,  the  results  are 
presented  and  compared  with  experimental  data.  The  simulations  trace  the  temporal  evolution  of  plasma 
growth  during  the  femtosecond  ionizing  pulse  and  calculate  the  resulting  self-induced  blue  shift  of  the  ionizing 
pulse  spectrum.  Variations  with  pulse  intensity,  gas  pressure,  and  gas  species  are  calculated  The  relative 
contributions  of  strong-field  ionization  and  electron-impact  ionization  on  the  frequency  shifts  are  discussed 
The  simulations  provide  qualitative  explanations  of  most  of  the  features  observed  experimentally  in  the  blue- 
shifted  spectra.  The  technique  of  spectral  blue  shifting  intense  femtoaecond  laser  pulses  provides  a  new  diag¬ 
nostic  tod  for  studying  strong-ndd  ionization  and  laser-induced  breakdown  in  dense  plasmu. 


1.  INTRODUCTION 

Ultrashort-pulse  lasers  with  intensities  greater  than 
10“  W/cm’  have  been  clearly  demonstrated.  These  lasers 
make  possible  the  ionization  of  small  regions  of  gas  in  a 
time  of  the  order  of  the  pulse  duration.  The  rapid  cre¬ 
ation  of  the  plasma  by  strong-field  ionization  is  accompa¬ 
nied  by  a  fast  decrease  in  the  index  of  refraction  of  the 
medium.  The  incident  laser  beam  is  thus  transmitted 
through  a  region  of  rapidly  varying  refractive  index, 
which  causes  a  frequency  upshift  in  the  laser  pulse.  Such 
field-ionization-induced  blue  shifts  were  observed  in 
the  experiments  of  Wood  and  co- workers'"*  with  100-fa, 
620-nm,  10“  W/cm’  laser  pulses. 

Even  for  moderate  laser  intensities  (10“-10'*  W/cm*)  it 
is  known  that  the  laser  frequency  can  shift  as  a  result  of 
the  variation  of  the  refractive  index  of  the  forming 
plasma.  Yablonovitch  and  Bloembergen  observed  spectral 
broadening  and  frequency  shifts  in  laser-breakdown  plaa- 
mas.*"*  In  their  case  plasma  formation  is  caused  by  ava¬ 
lanche  ionization  resulting  from  electron  impact.  More 
recently,  because  of  the  availability  of  lasers  with  intensi¬ 
ties  of  10“  Wcm*  and  above,  Wilks  et  al.*  and  Gildenburg 
ei  ai.“  suggested  that  the  mechaniam  of  strong-field  ion¬ 
ization  of  the  atoms  of  a  gas  can  be  used  for  effective  con¬ 
trol  of  the  frequency  upshifts  of  the  laser  radiation. 
These  field-ionization-induced  blue  shifts  were  verified  in 
the  experiments  of  Wood  and  co-workers'"*  in  the  noble 
gases.  Yablonovitch"  and  Downer  et  al.'^  also  explain 
that  such  blue  shifts  are  implied  in  the  above- threshold 
ionization  experiments  of  Freeman  et  al.'*  For  a  differ¬ 
ent  frequency  range,  Joshi  et  oi.'*  demonstrated  the  fre¬ 
quency  upshifting  of  microwave  radiation  by  laoer-ionized 
plasmas. 

The  frequency  shift  can  be  approximated  by  the  Drude 
model"  as 
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C  J  dt 

where  wo  is  the  original  frequency,  c  is  the  speed  of  light 
in  vacuum,  L  is  the  interaction  length,  nlx.tj  is  the  inde> 
of  refraction  along  the  propagation  direction  i. 

u>,  is  the  plasma  frequency,  and  y  is  the  collisional  fre¬ 
quency.  This  model  provides  a  simple  scaling  formula  for 
the  frequeiKy  shift  of  the  transmitted  laser  beam.  Ac 
cording  to  this  model,  large  frequency  shifts  can  be  ob¬ 
tained  by  simply  increasing  the  interaction  length  or  the 
gaa  density.  The  creation  of  intense  fields  can  currently 
be  accomplished  only  through  tight  focusing  of  the  laser 
beam;  therefore  long  interaction  lengths  may  be  difficult 
to  achieve  because  of  diffraction  effects  Spatial  gradi¬ 
ents  of  the  induced  index  serve  to  limit  this  interaction 
length  further  by  defocusing  the  laser  beam  The  high 
gaa  pressures  at  which  the  critical  density  of  free  elec¬ 
trons  is  achieved  would  limit  the  efficiency  of  the  fre¬ 
quency  upehift  process,  because  a  significant  portion  of 
the  laser  beam  would  be  reflected.  High  gas  pressures 
aiao  tend  to  increase  defocusing  because  of  the  higher 
plasma  density  and  resulting  higher  index  gradients  in  the 
interaction  r^on. 

The  initial  motivation  for  this  investigation  came  from 
the  experiments  of  Wood  and  co-workers.'  *  These  experi¬ 
ments  have  shown  that  the  magnitude  of  the  blue  shift,  as 
well  as  the  shape  and  time  development  of  the  blue-shifted 
spectra,  depends  strongly  and  reproducibly  on  noble-gas 
speciM,  fpu  pressure,  and  ionizing  laser  intensity,  as  is 
discuaaed  in  Section  3.  Our  main  objective  in  thr  ,.u 
merical  simulations  is  to  examine  whether  th-  •  or 
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experimental  trends  can  be  explained  in  the  context  of  ap¬ 
propriate  theories  of  ionization  in  intense  light  fields.  To 
this  end,  special  emphasis  is  placed  on  two  points.  First, 
the  tunneling  formula  of  Ammosov  et  al.'^  is  used  to 
evaluate  field-ionization  rates,  in  contrast  to  earlier  pre¬ 
liminary  analyses*  of  the  blue  shifts  based  on  the  Drude 
model  with  Keldysh^*  field-ionization  rates.  Since  recent 
ion-yield  experiments”'*  show  field  ionization  rates  that 
are  in  discord  with  the  Keldysh  formula  and  in  better 
agreement  with  the  tunneling  formula  of  Ammosov 
et  it  is  important  that  estimates  of  the  frequency 
shifts  be  evaluated  by  using  the  latter  formula.  Second, 
the  possible  contribution  of  coUisional  ionization  to  the  ob¬ 
served  blue  shifts  is  evaluated.  Recent  studies’  of  the 
pressure,  the  intensity,  the  species,  and  the  time  depen¬ 
dences  of  the  blue  shifts  provided  evidence  that  electron- 
impact  ionization  begins  to  supplement  field  ionization  in 
the  heavy  noble  gases  at  the  highest  light  intensity 
(10>«  W'crr.’)  and  pressure  [5  atm  (3.8  kTorr)]  studied  ex¬ 
perimentally.  A  Drude  model  that  combined  field-  and 
impact-ionization  rates  yielded  qualitative  agreement 
with  the  data.’  Here,  using  a  more  rigorous  procedure  in 
which  the  wave  propagation  is  solved  consistently  with 
electron  dynamics,  we  reexamine  the  relative  significance 
of  these  ionization  processes. 

We  use  a  one-dimensional  electromagnetic  plasma  simu¬ 
lation  code  to  study  the  ionization-induced  self-frequency 
shifts  of  the  femtosecond  ionizing  laser  pulses.  A  de¬ 
scription  of  the  simulation  model  is  given  in  Section  2. 
In  Action  3  we  present  some  of  the  experimental  results 
and  discuss  the  reason  for  invoking  the  significance  of  the 
collisiona!  ionization  process  in  explaining  the  observed 
frequency  shifts.  A  more  comprehensive  presentation  of 
the  experimental  results  is  being  prepared.'*  The  nu¬ 
merical  results  are  then  presented  in  Section  4.  The  con¬ 
clusions  are  given  in  Section  5. 

2.  SIMULATION  MODEL 


where  Wj  is  the  total  ionization  rate  for  the  production  of 
charge  state  j.  We  take  both  field  ionization  and  electron- 
impact  ionization  into  account.  The  fluid  equation  for 
the  current  density  is 


The  Keldysh  formula'*  for  the  ionization  rale  is  com¬ 
monly  used  to  describe  high-intensity  laser-atom  interac¬ 
tions.  An  important  feature  of  the  Keldysh  theory  is  that 
the  ionization  rate  ia  determined  more  by  the  properties 
of  the  outgoing  electron  than  by  the  details  of  the  atomic 
structure.  The  full  form  of  the  Keldysh  formula  for  the 
ionization  rate  is 


where  y  is  the  tunneling  parameter,  given  by 


r 


[2  7,(0)]“’ 
1 - 


The  model  solves  Maxwell's  equations  for  the  fields  and 
the  continuity  equations  for  the  electron  density  and 
current.  The  pulse  lengths  are  short  enough  that  ion 
motion  is  negligible.  We  consider  a  linearly  polarized 
electromagnetic  wave  traveling  along  x;  there  are  no  vari¬ 
ations  in  y  or  z  -  k,  —  0).  The  transverse  electromag¬ 
netic  wave  components  arc  E,{x,t)  and  The 

source  wave  is  incident  from  left  to  right  on  a  gas  cell,  for 
which  we  specify  a  gas  density  uniform  along  x  for  a  given 
interaction  length  L.  The  propagation  of  the  wave  is  simu¬ 
lated  by  integration  of  Maxwell’s  equation: 

'^E  dB 

-  cV  X  B  -  4itJ,  —  -  -cV  X  E. 

at  dt 

Our  previous  simulations  show  that  for  laser  intensities 

of  10‘*-10”  W/cm’  the  dynamics  of  the  plasma  electrons  is 
dominated  by  the  transverse  motion.”  Furthermore,  for 
plasma  densities  greater  than  10'*  cm"’  the  spatial  profile 
of  the  electron  density.  n,{x,t),  is  well  constrained  by  the 
space-charge  forces.  Thus  the  electron  density  is  deter¬ 
mined  only  by  ionization,  while  the  interaction  of  the  field 
with  the  plasma  is  determined  only  by  the  transverse  cur¬ 
rent  density,  Jy(x,t}.  Let  Nj{x,t)  be  the  number  density 


E  ia  the  electric  field  stre  ,3,  u  is  the  laser  frequency, 
and  U,(.0)  ia  the  field-free  io..ization  potential.  The  other 
terms  in  the  formula  are 


The  factor  A  in  the  Keldysh  formula  is  of  order  unity  and 
accounts  for  a  weak  dependence  on  the  details  of  the  atom 
In  most  of  our  calculations  we  choose  the  tunneling  for¬ 
mula  of  Ammosov  et  al}*  to  calculate  the  field-ionization 
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ratei.  These  retee  are  much  higher  than  those  calculated 
with  the  Kel4ysh  formula.'*  Recent  ion>yield  experi- 
ments'’'*  show  that  the  Ammosov  et  cU.  formula  is  in 
better  agreement  with  data.  The  Ammosov  et  at. 
field-ionization  rate  is  given  by 


b^L* 


where  »»  is  the  atomic  unit  of  frequency  (—4.1  x 
10‘*  s'').  The  effective  quantum  number  is  obtained  by 
equating  the  ionization  potential  U,  of  the  critical  electron 
orbit  with  {Z^/nM^Vu, 


Z 

where  Uh  is  the  ionization  potential  of  H  (-13.6  eV)  and  Z 
refers  to  the  residual  charge  seen  by  this  electron.  The 
formulation  of  Ammosov  et  al.  also  used  in  the  analysis  in 
Ref.  3. 

A  number  of  empirical  formulas  for  the  electron-impact 
ionization  cross  sections  have  been  proposed.  The  for¬ 
mula  derived  by  Lotz^‘  is  widely  used  and  seems  to  be  in 
good  agreement  with  measured  data.^’  The  cross  section 
for  impact  ionizing  an  ion  in  its  electronic  ground  state  in 
charge  state  <  to  produce  a  ground-state  ion  in  charge 
state  i  -t-  1  is 


■‘““"[''"(I  ■')]}■ 


E.  is  the  electron  energy,  P^,  is  the  ionization  potential  for 
the  outer  shell,  P,t  is  the  binding  energy  for  the  first  inner 
subshell,  Pj  is  the  binding  energy  for  the  second  inner 
subshell,  etc.  is  the  number  of  electrons  in  the  jth 
shell.  Oy,  6^,  and  are  constants.  For  charge  states 
greater  than  ~3,  Lotz  uses  —  4.5  x  10'*^  cm*  and 
-  c„  -  0.  Ws  use  Oy  -  9  X  10''*  cm’  eV*  to  take  into 
account  any  effects  caused  by  collisional  ionization  of  ex¬ 
cited  state  species.  In  the  experimental  situation  the  tar¬ 
get  ions  are  bathed  in  a  light  field  as  strong  as  10**  W/cm’ 
and  are  thus  likdy  to  be  in  excited  electronic  states.  In 
an  excited  state  the  Py  valuee  become  smatter,  and  ccxise- 
quently  the  ionization  cross  section  increases.  In  fact 
Lotz*'  noted  that  the  same  formula  could  be  used  to  calcu¬ 
late  the  enhanced  cross  section  for  a  psrticular  excited 
state.  Nevertheless,  the  excited-state  distribution  is  dif¬ 
ficult  to  estimate.  Furthermore,  measurements  of  the 
impact-ionization  cross  section  of  electrosucaily  excited 
species  are  lacking.  Consequently,  for  the  ressainder  of 
this  paper,  we  simply  use  twk«i  ths  standard  ground-etate 
cross  ssctions.  Ilus  spproximaUfln  slamld  yield  ths  cor¬ 
rect  qualitative  contribution  of  collisional  ionixatioa  to 
blus  shifting  but  may  undsrestimats  its  magnitude  to 
some  extent. 

The  source  laser  is  represented  by  pisne  polsrized-wave 
solutions  to  Maxwell's  equations  in  vacuum.  A  Gaussian 
temporal  pulse  profile  is  assumed  for  the  laser  intensity 
The  form  of  the  fields  incident  upon  the  gas  is  thus 


—  fl,  —  8in(a»ot)exp(-t’/(^’), 


where  W)  is  the  frequency  of  the  source  laser  and  is 
0.6  times  the  pulse  length  (to  give  a  Gaussian  FWHM 
equal  to  the  specified  pulse  length). 

The  numerical  solution  of  Max  well's  equation  is  tmpie- 
msnted  in  one  dimension  along  the  propagation  direction 
X  by  using  the  advective  differencing  algorithm.” 
Ths  transverse  fields  are  separated  into  left-  and  right- 
fouig  coasponents.  By  choosing  Ax  —  cUt,  these  coaipo- 
nsnta  are  advanced  in  time  simply  by  shifting  the  valuea 
ovar  oos  cell.  Let  £,*  and  be  ths  right-  and  left-goinf 
coasponents,  respectively.  These  field  components  are  ad¬ 
vanced  in  time  aa 


E*{t  -t-  At,x  Ax)  —  E,\t,x)  -  wAt 

t( 


.4*f‘) 


£/(t  At, x)  —  -  Ax) 

-  n-Af 


.1 


, ,  At 


The  transmitted  and  reflected  fields  are  defined  as 
E,"{t,x  —  L)  and  E,‘'{t,x  -  0),  respectively.  The  bound¬ 
ary  values  for  the  incident  fields  are 


E,^{t,x  •  0)  “  Eo  sin(wot)exp<-tV/pu— 'J, 
£/(t,x  -  L)  -  0. 


The  electron  current  is  advanced  in  time  as 


At  — n,(t,x)[ £',''(t, I)]. 
m 


The  electron  density  is  obtained  from  the  ion  charge  state 
distribution,  which  evolves  according  to  a  simple  Euler 
scheme.  Tests  were  made  for  the  stability  of  the  ion 
charge  state  evolution  by  using  more  robust,  but  more 
time  consuming,  ordinary  differential  equation  solvers. 

Several  checks  of  the  Maxwell  solution  were  performed 
by  using  an  incident  laser  pulse  with  a  Gaussian  temporal 
profile.  When  the  temporal  derivative  of  the  refractive 
index  is  constant  for  the  duration  of  the  pulse,  the  result¬ 
ing  profile  will  be  a  Gaussian  of  the  same  width  as  the 
original  but  centered  at  to  —  oto  Aw.  The  entire  spec¬ 
trum  will  have  shifted  by  the  amount  Aw.  As  a  test  case 
we  aasume  that  the  ionization  is  produced  externally,  in- 
dspsndently  of  the  laser  pulse,  is  uniform  over  the  interac¬ 
tion  length,  and  has  a  constant  temporal  derivative. 
Accerdiag  to  the  Drude  model,  for  an  incident  wavelength 
A»  ud  an  interaction  length  L  the  shift  is  given  by 

A  A  »  dn, 

2nm,c*  dt 

For  an  incident  laser  wavelength  of  620  nm  and  an  inter¬ 
action  length  of  10  /tm  an  ionization  rate  of  2.805  x 
10”  cm"*  8"'  wiU  result  in  a  spectral  blue  shift  of  10  nm 
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Fig.  1.  Calculated  spectra  for  a  lOO-fs  laser  pulse  with  an  origi¬ 
nal  wavelength  of  620  nm.  The  laser  is  propagated  through  a 
spatially  uniform  plasma  with  an  interaction  length  of  10  fim  and 
an  ionization  rate  of  2.805  x  10"  cm”’  s"'.  The  dotted  curve 
is  the  incident  spectrum:  the  solid  curve  is  the  transmitted 
spectrum. 


Wavelength  (nm) 

Fig.  2.  Measured  spectra  after  shifting  through  5  atm  (3.8  kTorr) 
of  He  and  Ar  gases  with  a  peak  laser  intensity  of  lO"  W/cm’. 

Figure  1  shows  the  transmitted  pulse  spectrum  as  ob¬ 
tained  from  the  Maxwell  solver  for  this  test  case.  Since 
there  is  no  spatial  variation  within  the  plasma,  the  solu¬ 
tion  agrees  with  the  Drude  model,  as  expected. 

3.  EXPERIMENTAL  RESULTS 

In  this  section  we  present  some  of  the  experimental  re¬ 
sults  and  discuss  the  reason  for  invoking  the  significance 
of  the  colHsional  ionization  process  in  explaining  the  ob¬ 
served  frequency  shifts. 

In  the  experiments  of  Wood  and  Downer lOO-fs 
laser  pulseo  with  center  wavelengths  of  620  nm  and  ener¬ 
gies  less  than  or  equal  to  0.4  mJ  are  focused  at  f/5  into  a 
glass  cell  containing  a  pressure  of  between  1  and  5  atm 
(0.76  and  3.8  kTorr)  of  He,  Ne,  Ar,  Kr,  or  Xe  gases.  The 
minimal  focal  spot  size  is  measured  as  7  ^im  in  diameter 
in  order  for  the  intensity  to  reach  1/e  of  the  central  inten¬ 
sity  maximum.  Light  transmitted  through  the  focal  re¬ 
gion  is  collected  and  analyzed  by  a  apectrometer.  The 


spectra  of  the  laser  pulses  are  recorded  for  varying  gases, 
pressures,  and  pulse  energies.  Typically  less  than  1%  of 
the  pulse  energy  is  lost  in  ionizing  the  gas. 

Figure  2  shows  the  originsJ  pulse  spectrum  and  the 
measured  spectra  siter  shifting  through  6  atm  of  He  and 
Ar  gases  with  a  nominal  peak  intensity  of  10"  W/cm*.  In 
He  a  significant  amount  of  the  pulse  energy  remains  un¬ 
shifted,  indicating  that  the  derivative  of  the  index  of  re¬ 
fraction  or,  equivalently,  the  ionization  rate,  is  zero  during 
a  significant  portion  of  the  laser  pulse.  In  Ar  almost  the 
entire  pulse  spectrum  is  shifted  away  from  the  original 
spectrum,  indicating  a  nonzero  value  for  the  derivative  of 
the  index  or,  equivalently,  for  the  ionization  rate,  during 
the  entire  pulse.  The  same  qualitative  feature  is  ob¬ 
served  in  Kr  and  Xe  gasea. 

Figure  3  shows  the  effect  of  varying  the  laser  pulse  en¬ 
ergy  on  the  measured  spectrum  after  breakdown  in  Kr 
gas  at  6  atm  of  presstire.  The  center  of  the  unshifted 
pulse  spectrum  is  indicated  by  a  vertical  line.  As  the 
pulse  energy  is  increased,  the  spectrum  peak  moves  to¬ 
ward  the  blue  until  the  pulse  energy  reaches  0.01  mJ 
[-log(Energy)  —  5.0],  after  which  the  position  of  the  peak 
appears  to  change  little.  As  the  energy  increases  further, 
a  shoulder  appears  on  the  blue  side  of  the  spectrum,  which 
becomes  broiuler  and  shifts  farther  toward  the  blue. 

Figure  4  shows  the  measured  spectra  after  interaction 
with  each  of  the  noble  gases  at  5  atm  of  pressure  for  a 
pulse  energy  of  0.26  mJ.  A  blue  shift  ocpirs  in  all  cases. 
However,  the  shape  of  the  blue-shifted  spectrum  depends 
strongly  on  the  gas  species.  In  Ar,  Kr,  and  Xe  the  nar¬ 
row,  less-shifted  peak  and  the  broad,  blue  shoulder  are 
clearfy  discernible,  and  the  shoulder  appears  as  a  separate 
peak.  The  blue  shoulder  shows  greater  shifting  as  the 


Fig.  3.  Measured  spectra  after  interaction  with  5  ettn  of  Kr  as 
a  function  of  pulse  energy.  Pulse  energy  is  increasing  toward  the 
bottom  of  the  figure  in  steps  of  The  center  of  the  origi¬ 

nal  spectrum  is  indicated  by  a  vertical  line  at  620  nm. 


Fig.  4.  Meastured  spectra  after  interaction  of  0  2 5- raJ  pulses  m  5 
atm  of  aach  of  the  noble  gasea  studied.  The  center  of  original 
pulse  spectrum  is  indicated  by  a  vertical  dashed  line  at  620  nm 
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Fig.  5.  Measured  spectra  after  interaction  of  0.2&-mJ  puJaes 
with  Ar  gas  as  a  function  of  gas  praaaura.  Praaaure  is  increasing 
toward  the  bottom  of  tha  figure.  The  eantar  of  original  pulse 
spectrum  ia  indicated  bgr  a  vertical  dashed  line  at  620  nm. 

atomic  weight  of  the  gas  ia  increased,  corresponding  to  the 
plasma  densities  that  are  attained  in  the  heavier  gases. 
At  the  same  time  the  height  of  the  narrow,  less-shifted 
peak  decreases.  In  He  and  Ne,  in  contrast,  the  blue 
shoulder  is  absent.  Instead  a  residual  shoulder  appears, 
corresponding  approximately  to  the  unshifted  spectrum. 

The  effect  of  gas  pressure  on  the  measured  spectra  in 
Ar  for  a  pulse  energy  of  0.25  mJ  is  shown  in  Fig.  5.  At  all 
pressures  shown  there  is  the  narrow  peak  that  is  blue 
shifted  by  6  nm.  Only  for  preasurea  above  1  atm  does  the 
broad,  more-blue-shifted  shoulder  appear  to  be  discern¬ 
ible.  The  blue  shoulder  shows  greater  shifting,  and  the 
height  of  the  narrow,  less-shifted  peak  decreases  as  the 
gas  pressure  ia  increased.  Also,  the  breadth  of  the  broad 
blue  shoulder  increases  as  the  gas  pressure  ia  increased. 
Note  the  absence  of  a  residual  shoulder  even  at  the  lower 
pressures  of  Ar. 

Although  the  amplitude  and  the  width  of  these  spectral 
features  depend  on  the  focal  profile,  chirp,  and  other  de¬ 
tails  of  the  ionizing  pulse,  numerous  measurements  con¬ 
firm  that  the  qualitative  trends  shown  in  Figs.  2-6  are 
reproducible  features  of  femtosecond  ionization  of  noble 
gases.  Time-resolved  pump-probe  experiments  were  also 
performed.’’^  These  experiments  show  that  (in  Ar,  Kr, 
and  Xe)  the  broad  blue  shoulder  evolves  well  in  the  leading 
edge  of  the  ionizing  pulse,  while  the  less-shifted  peak 
arises  at  or  slightly  after  the  peak  of  the  ionizing  pulse. 
Furthermore,  the  single  blue-shifted  peak  in  He  and  Ne 
(like  the  broad  blue  shoulder  in  Ar,  Kr,  and  Xe)  evolves  in 
the  leading  edge. 

In  calculating  the  plasma  density,  using  either  Keldysh 
or  Ammosov  et  al.  strong-field  ionization  but  without  col- 
lisional  ionization,  it  is  observed  that  at  all  intensities,  in 
each  gas,  the  ionization  has  saturated  by  the  peak  of  the 
laser  pulse,  or  within  10-20  fs  after  the  peak  of  the  pulse 
(see  the  dashed  curves.  Fig.  6).  This  restilt  implies  that 
in  all  gases  a  significant  portion  of  the  ionizing  pulse  en¬ 
ergy  would  remain  unahifted,  as  ia  observed  in  He.  Be¬ 
cause  the  data  for  Ar,  Kr,  and  Xe  do  not  exhibit  this  type 
of  behavior,  there  ia  the  implication  that  significant  ioniza¬ 
tion  occurs  during  and  for  some  time  after  the  peak  of  the 
laser  pulse.  This  additional  ionization  may  be  caused  hy 
electron-impact  collisions  with  the  gas.  T^  initial  inter¬ 
pretation  that  comes  from  the  Drude  model  calculations*^ 
is  the  following;  Strong-field  ionization  before  the  peak 
of  the  laser  pulse  gives  rise  to  the  broad  blue  shoulder; 


Peneirante  tt  al 

collisional  ionization  after  the  peak  of  the  laser  pulse  gives 
rise  to  the  narrow,  less-blue- shifted  peak.  In  Section  4 
we  present  numerical  simulations  to  investigate  whether 
this  preliminary  interpretation  of  the  experimental  dau 
is  correct. 

4.  CALCULATIONS  OF 
IONIZATION-INDUCED  EFFECTS 

Figure  6(a)  shows  the  calculated  evolution  of  the  degree  of 
ionization  in  5  atm  of  the  noble  gases  illuminated  by  a 
100-fs  laser  pulse  with  a  peak  intensity  of  10“  W/cm*,  with 
the  Keldysh  rates  used  for  strong-field  ionization.  The 
solid  curves  are  calculations  with  collisional  ionization  ac¬ 
cording  to  the  Lots  formula,  while  the  dashed  curves  are 
calculations  without  collisional  ionization.  Figure  6(b) 
shows  the  corresponding  calculations  made  using  the 
Ammosov  et  al.  rates  for  strong-field  ionization  Note 
that,  except  for  He,  during  the  peak  of  the  laser  pulse 
there  is  a  distinct  difference  in  the  degrees  of  ionization 
calculated  with  and  without  collisional  ionization.  This 
difference  is  quite  dramatic  when  the  Keldysh  rates  are 
used  for  the  strong-field  ionization.  Although  these 
differences  are  less  significant  when  the  Ammosov 
et  al.  rates  are  used,  they  are  still  apparent.  Also,  note 
that  the  degrees  of  ionization  obtamed  with  the  Ammosov 

(a)  using  Keldysh  rates 


Pig.  6.  Evolution  of  the  degree  of  ionization  in  5  atm  of  the  noble 
gases  with  a  peak  laser  intensity  of  10“  W/cm',  calculated  using 
(a)  Ksktyah  and  (b)  Ammoeov  et  al.  rates  for  itrong-field  ioniza¬ 
tion.  The  solid  curvet  are  csdculations  tliat  include  collisional 
ionization;  the  dashed  curves  are  without  collisional  ionization 
The  time  profile  of  the  laser  pulse  is  shown  by  the  dotted  curv>>!; 
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Fig.  7.  Spectra  for  ionization  of  5  atm  of  Xe  by  JOO-fa  pulae 
of  peak  intensity  10'*  W/cm’,  calculated  using  Keldysh  and 
Ammosov  et  al.  rates  for  strong-field  ionization.  The  laser- 
plasma  interaction  length  is  10  The  dotted  curves  shows 
the  original  spectra;  the  solid  curves  are  the  shifted  spectra  in¬ 
cluding  coilisionaJ  ionization;  the  dashed  curves  are  without  coUi- 
sional  ionization. 

et  al.  rates  are  much  larger  than  those  obtained  with  the 
Keldysh  rates. 

The  magnitude  of  the  differences  in  the  ionization 
degree  is  reflected  in  the  calculated  spectral  shifts. 
Figure  7  shows  the  spectra  in  5  atm  of  Xe  calculated  by 
using  the  Keldysh  rates,  compared  with  that  for  the 
Ammosov  et  al.  rates.  The  laser-plasma  interaction 
length  is  assumed  to  be  10  in  these  calculations. 
With  the  Keldysh  rates  there  is  a  dramatic  difference  in 
the  spectral  shifts  with  and  without  coUisional  ionization, 
whereas  this  difference  is  quite  small  when  the  Ammosov 
et  al.  rates  are  used  to  calculate  the  field  ionization.  The 
steeper  rate  of  increase  in  the  degrees  of  ionization  ob¬ 
tained  with  the  Ammosov  et  al.  rates  corresponds  to  much 
larger  frequency  blue  shifts  in  the  transmitted  spectrum, 
as  one  would  expect  from  the  simple  Drude  model.  Fur¬ 
thermore,  only  the  Ammosov  et  al.  formulation  correctly 
yields  the  double-peaked  spectral  shape  observed  experi¬ 
mentally  (see  Fig.  4)  w'.th  5  atm  of  Xe,  confirming  that  it 
provides  the  more  accurate  description  of  ionization  under 
the  experimental  conditions. 

The  results  shown  in  Figs.  6  and  7  suggest  that  the  sig¬ 
nificance  of  coUisional  ionization  depends  strongly  on  the 
values  of  the  field  ionization  rate.  Recent  experiments’’"'* 
shov/  field  ionization  rates  that  are  in  better  agreement 
with  the  tunneling  formula  of  Ammosov  et  al.  '^  For  the 
remainder  of  this  paper  we  use  the  Ammosov  et  al.  rates 
to  calculate  the  frequency  shifts. 
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Figure  8  shows  the  calculated  spectrum  in  5  atm  of  the 
noble  gases.  The  effect  of  coUisional  ionization  is  negli¬ 
gible  in  the  case  of  He,  as  is  evident  also  from  Fig.  6(b),  in 
agreement  with  earlier  analyses.*'*  For  Ne  the  effect  of 
coUisional  ionization  is  smaU  but  distinct.  There  is  little 
contribution  from  coUisional  ionization  during  the  second 
half  of  the  laser  pulse.  CoUisional  ionization  does  not  sig¬ 
nificantly  alter  the  shape  of  the  spectrum  but  merely 
shifts  the  entire  spectrum  a  little  bit  toward  the  blue. 
The  spectrum  in  Ne  shows  a  prominent  blue-shifted  peak 
and  a  smaU  residual  peak,  as  is  observed  experimentaUy 
(see  Fig.  4).  For  the  heavier  gases  Ar,  Kr,  and  Xe  the 
frequency  shifting  manifests  itself  in  two  ways.  First, 
there  is  the  appearance  of  a  broad,  blue  spectral  shoulder 
that  is  shifted  from  the  original  by  as  much  as  35  nm  in 
Kr  for  the  parameters  used  in  Fig.  8.  TemporaUy  this 
feature  arises  mainly  from  the  lower  ionization  stages,  oc- 
curing  weU  within  the  leading  edge  of  the  ionizing  pulse, 
in  agreement  with  the  time-resolved  experiments." 
This  feature  is  unaffected  by  including  collisions  in  the 
model  and  thus  originates  purely  from  strong-field  ioniza¬ 
tion.  Second,  the  extra  blue  shifting  resulting  from  the 
higher  ionization  degrees  attained  in  the  heavier  gases 
sometimes  causes  the  residual  peak  to  be  blue  shifted  even 
in  the  absence  of  coUisions.  Compare,  for  example,  the 


Wavelength  (nm) 

Fig.  8.  Spectra  for  ionization  of  6  atm  of  He,  Ne,  Ar,  and  Kr  by 
a  100-fa  pulae  of  peak  intensity  10'*  W/ctn*,  calculated  with  the 
Ammosov  et  al.  rates  used  for  field  ionization.  The  laser-plasma 
interaction  length  is  10  4m.  The  dotted  curves  shows  the  origi¬ 
nal  spectra;  the  solid  curves  are  the  shifted  spectra  including  col- 
iisional  ionization;  the  dashed  curves  are  without  coUisional 
ionization. 
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Fig.  9.  Calculatad  evolution  of  the  degroo  of  ioniMtioa  in  S  atm 
of  Ar  for  various  p«ak  laser  intensities.  The  time  profile  of  the 
laser  pulse  is  shown  by  the  dotted  curve. 


spectra  of  Xe  and  Kr  (dashed  curves)  in  Fig.  8.  Since  the 
highest  ionization  stage  in  Xe  occurs  at  a  slower  rate  than 
the  lower  stages  [Fig.  6(b}],  it  causes  the  blue  shift  of  the 
tall,  narrow,  lesa-shifted  peak  rather  than  a  further  shift 
in  the  broad,  blue  shoulder.  Furthermore,  Fig.  6(b)  shows 
that  this  final  stage  (and  thus  the  blue  shifting  of  the  less- 
shifted  peak)  should  ocoir  near  the  peak  of  the  ionizing 
pulse,  in  qualitative  agreement  with  time-resolved  experi¬ 
ments.^  Thus  femtosecond  ionization  of  5-atm  Xe  at 
10”  Wcm’  light  intensity  provides  a  specific  example  in 
which  the  main  qualitative  features  of  the  data*  can  be 
explained  without  invoking  coUisional  ionization  at  all. 
Elucidation  of  such  an  example  is  a  major  new  point 
brought  out  by  the  current  simulation,  udiich  was  not  evi¬ 
dent  in  earlier  analyses*^  of  the  bhie-shift  data. 

Nevertheless,  in  other  cases  coUisional  ionization  may 
stiU  be  necessary  to  account  for  the  main  features  of  the 
data.  For  example,  without  collisions  in  the  model  the  Ar 
and  Kr  spectra  (Fig.  8,  dashed  ciu^ee)  each  show  a  promi¬ 
nent  unshifted  spectral  peak,  in  contradiction  to  the  data 
(Fig.  4).  With  coUisional  ionization  included,  on  the  other 
hand,  this  residual  peak  is  blue  shifted.  Furthermore,  its 
extra  shift  occurs  temporaUy  near  the  peak  of  the  ionizing 
pulse  [see  the  soUd  curves  in  Fig.  6(b)],  in  qualitative 
ag  reement  with  time-resolved  experiments.*  Even  in  the 
case  of  Xe,  coUisional  ionization  supplmnents  the  effect 
described  above  by  producing  an  extra  shift  of  the  less- 
blue-shifted  feature  near  the  temporal  peak  of  the  ioniz¬ 
ing  pulse.  The  calculated  collision-induced  shift, 
however,  is  smaller  (1-2  nm)  in  magnitude  than  the  shift 
(5  nm)  observed  experimentally.  The  assumption  of 
twice  the  ground-state  impact  ionization  cross  section 
may  be  responsible  for  this  discrepancy  by  underestimat¬ 
ing  the  impact  ionization  rate  of  electronically  excited  spe¬ 
cies.  Conceivably  a  high  degree  of  electronic  excitation 
could  reaiilt  in  an  even  higher  croea  section.**  A  detailed 
discussion  of  this  interpretation  is  in  preparation.** 

The  calculated  evolution  of  the  degree  of  ionization  in 
Ar  is  shown  in  Fig.  9  for  variotu  peak  laser  intensities. 
The  corresponding  spectra  are  shown  in  Fig.  10.  At  a 
laser  intensity  of  5  x  10”  Wcm*,  Ar  is  only  singly  ionized. 
At  a  pressure  of  5  atm  and  an  interaction  length  of  10  nm, 
this  calculation  produces  a  blue-shifted  peak  that  is  4  nm 
from  the  original.  A  smaU  residual  shoulder  correspond¬ 


ing  to  the  far-red  wing  of  the  unshifted  spectrum  is  evi¬ 
dent.  At  a  laser  intensity  of  10“  W/cm*,  Ar  is  doubly  ion¬ 
ized.  This  produces  a  main  blue  shift  of  7  nm.  Further¬ 
more,  the  residual  shoulder  is  now  centered  closer  to  the 
center  of  the  unshifted  spectrum.  At  a  laser  intensity  of 
5  X  10“  W/cm*,  Ar  is  more  than  four  times  ionized.  Al¬ 
most  the  entire  spectrum  is  shifted  away  from  the  origi¬ 
nal  spectrum.  The  spectrum  becomes  broader  and  is 
shifted  farther  toward  the  blue.  Concurrently  relatively 
little  unshifted  Light  remains.  This  produces  a  new  main 
peak,  which  is  blue  shifted  by  —4  nm;  and  a  brood,  blue 
shoulder  that  is  bltte  shifted  by  14  nm.  At  a  laser  inten¬ 
sity  of  10“  W/cm*,  ai^proximately  half  of  this  main  peak  is 
blue  shifted  farther,  causing  a  subdivision  into  two  peaks. 
The  other  half  is  once  again  centered  on  the  original  pulse 
spectrum.  The  broad,  Uue  shoulder  is  also  shifted  far¬ 
ther,  giving  the  appearance  of  a  third  peak. 

The  effect  of  gas  pressure  on  the  calculated  spectra  in 
Ar  is  shown  in  Fig.  11  for  a  peak  laser  intensity  of 
10“  W/cm*.  As  the  pressure  is  increased  to  2  atm.  only 
one  peak  is  prominent,  and  the  blue  shift  is  simply  propor¬ 
tional  to  the  pressure.  As  the  pressure  is  increased  to 
3  atm  this  peak  begins  to  subdivide,  giving  the  appearance 
of  a  plateau.  At  a  pressure  of  4  atm  the  broad,  blue  shoul¬ 
der  becomes  evident,  as  well  as  the  narrow,  less-blue- 
shifted  peak. 

The  above  calculations  show  additional  examples  of  con¬ 
ditions  under  which  the  simulation  without  collisions  can 
reproduce  the  main  qualitative  features  of  the  data.  Most 
noteworthy  is  the  case  of  5-atm  Ar  at  5  x  10“  W/cm* 
(Fig.  10).  In  this  case  the  simulation  yields  a  broad  blue 
shoulder  and  a  less-shifted  peak,  with  little  residual  un- 
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Fig.  10.  Calculated  spectra  in  5  atm  of  Ar  for  various  peak  laser 
intensities.  The  dotted  curve  shows  the  original  spectrum 
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Fig.  11.  Calculated  spectra  for  ionization  of  Ar  at  various  pros' 
aures  with  a  peak  laser  intensity  of  10‘*  W/cm*.  The  dotted  curve 
shows  the  original  spectrum. 

shifted  energy,  quite  like  the  corresponding  measured 
self-blue-shifted  spectra  (Figs.  4  and  5).  Furthermore, 
the  calculated  temporal  behavior  qualitatively  agrees  with 
time-resolved  experiments.^  In  other  cases,  however,  the 
simulated  blue  shift  disagrees  significantly  with  the  ex¬ 
periment.  For  example,  as  intensity  increases  to 
10'*  W/cm*  in  5-atm  Ar,  the  reappearance  of  a  large  frac¬ 
tion  of  unshifted  energy  (caused  by  completion  of  sixfold 
ionization  in  the  leading  edge  of  the  pulse)  is  a  trend  not 
observed  experimentally.  On  the  other  hand,  coUiaiona) 
ionization  at  a  sufficient  rate  might  maintain  the  blue 
shifting  of  the  entire  pulse  spectrum  as  intensity 
increases. 

A  more  definitive  comparison  between  experiment  and 
simulation  canriot  be  drawn  at  this  time  for  two  reasons. 
First,  the  experimental  peak  intensity  at  the  focus  is  diffi¬ 
cult  to  measure.  Our  estimates  are  only  accurate  to 
within  a  factor  of  2  or  3.  Second,  the  simulation  is  1-D 
and  thus  does  not  take  into  account  either  the  spatial  av¬ 
eraging  along  the  radius  of  the  laser-plasma  interaction 
region  or  the  defocusing  of  the  laser  pulse  at  higher  gas 
pressures.  To  include  such  effects  would  require  a  two- 
dimensional  calculation.  Nevertheless,  even  the  1-D 
simulation  successfully  reproduces  most  of  the  features  of 
the  data.  Uncertainty  remains  in  the  interpretation  of 
some  details,  notably  the  importance  of  collisional  ioniza¬ 
tion  in  producing  a  secondary  blue  shift  near  the  temporal 
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peak  of  the  ionizing  pulse  in  several  atmospheres  of  the 
heavier  rare  gases. 

5.  CONCLUSIONS 

The  technique  of  spectral  blue  shifting  of  intense  fem¬ 
tosecond  lam  pulsM  p.'ovides  a  powerful  diagnostic  of 
plasma  evolution  at  atmospheric  densities.  Both  the  in¬ 
tensity  and  the  short  rise  times  of  these  laser  pulses  lead 
to  high  rates  of  plasma  formation,  which  in  turn  lead  to 
measurable  blue  shifts  in  the  transmitted  spectra.  Such 
experimenta  aignincantly  complement  standard  diagnos- 
tica  of  atrong-field  ionization,  such  as  ion  yield  and  pho¬ 
toelectron  speetroecopy,  by  providing  femtosecond  time 
reeohition  a^  bry  permitting  experiments  in  much  denser 
plasmas.  Using  a  more  rigorous  procedure  than  previous 
analyses,*^  the  1-D  electromegnetic  plasma  simulations 
presented  here  provide  qualitative  explanations  of  most  of 
the  feeturee  observed  experimentally  in  the  blue-shifted 
spectra. 

The  magnitude  of  the  spectral  shifts  and  the  relative 
importance  of  collisional  ionization  depend  strongly  on 
both  the  field-ionization  and  colHsional-ionization  rates 
used  in  the  simulations.  Our  simulations  lead  to 
two  main  conchuiona  that  were  not  evident  in  previous 
analyBea.*^  First,  the  contribution  of  collisional  ioniza¬ 
tion  to  the  evolution  of  the  plasma  density  is  almost  negli¬ 
gible  if  we  use  the  Ammosov  et  al.  rates  for  field 
ionization  and  the  Lotz  rates  for  electron-impact  ioniza¬ 
tion  of  electronic  ground-state  species.  If.  on  the  other 
hand,  the  impact  ionization  cross  sections  are  enhanced 
(by  >2  times)  because  of  electronic  excitation  of  the  target 
ions,  collisional  ionization  would  then  contribute  substan¬ 
tially  in  a  manner  consistent  with  the  data.  Second,  at 
some  excitation  levels  in  many  of  the  gases  the  1-D  simu¬ 
lation  explains  the  main  qualitative  features  of  the  data 
without  invoking  a  collisional  ionization  mechanism. 
However,  some  cases  remain  (high  intensit'-  and  pressure 
in  some  of  the  heavy  rare  gases)  for  which  Gaining  the 
data  may  still  require  a  ctdlisional  mecha:  n.  Further 
studies  will  be  needed  before  the  role  of  ct  ional  ioniza¬ 
tion  can  be  established  or  ruled  out  definitely. 

For  more  detailed  comparison  with  experiment,  a  two- 
dimensional  calculation  is  necessary  if  effects  such  as 
beam  defocusing  are  to  be  included. 
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OPTICAL  INTERACTIONS  WITH 
CONDENSED  MATTER  AND  j 
ULTRAFAST  PHENOMENA 


QThDI  Temperature  dependence 
of  the  refractive  indices 
and  optical  band  gap  of 
thin  amorphous  silicon 
films 

X.  Do,  P.  T.  Leung,  L,  Klees,  A.  C.  Tam,  F. 
Tong,  W.  P.  Leung 

'.-M  Ri’M\>rch  Diiiiioi,  Almndii:  Research 
Ci  ■::a,  i5u  Hrirr  j  RMd,  San  ]ose  California 

'.a.'ijtion  oi  the  optical  constants  with 
:empt'rniure  for  amorphous  sibcon  (a-Si),  as 
■•MM  as  other  solidi,  is  important  in  pho- 
tv'-herm.al  processing,  such  as  laser  anneal- 

i.ng  Although  crystal  silicon  and  other 
semiconductor  materials  have  been  studied 
extensively,  a-St  is  less  studied  in  this  aspect. 
Previous  studies  have  reported  that  there  is 
insignificant  temperature  dependence  of  the 
.ihxorption  coefficient  at  the  NdiYAG  laser 
eiength  CL06  for  a-Si  samples  of 

-..•-'micron  thickness.  This  has  then  been 
metimes  assumed  to  be  a  general  property 
j-Si  in  studies  insolving  laser  pulse  inter- 
with  the  material,^ 

In  this  work,  we  present  a  thorough  study 
- 1  the  dependence  of  the  real  and  imaginary 
reiractive  index  n(7T  and  k(T)  as  functions  of 
temperature  T  for  a-Si.  Such  dependence  is 
sensitive  to  sample  thickness  and  optical 
wavelength. 

To  determine  the  optical  constants  for  the 
samples,  we  have  followed  the  standard  re- 
•Xctance  (R)  and  transmittance  (D  measure- 
m.t  r.is  in  which  R  and  I  are  measured  at 
;.,:;ii"-t  nor.mal  incidence  and  m.T)  and  HT)  at 
.i  :  ■  ,  J  temperature  are  obtained  from  an  it- 
■.  i.'itii  e  computer  program.  The  samples  we 
rtudv  ha\e  the  thicknesses  of  0.2  pm  and  1.0 
..m  The  two  different  incident  wavelengths 
.ue  t].7S2  pm  and  1.15  pm.  The  results  are 
sho  wn  in  Fig.  1  and  2  from  which  we  observe 
the  almost  linear  rise  of  tifTl  with  tempera- 
ru.'e  which  is  comparable  to  that  with  crystal 
-ilicon  (c-Si).  However,  k(T)  varies  quite  dif- 
ic-entlv  and,  in  general,  increases  at  a  slower 
r.,;e  For  the  0.2-pm  thickness  at  l.lS-pm 
.  v.  viength,  Jk'dT  is  indeed  negligible;  but 
:■  :  lor  a  case  with  more  absorptive  wave- 
■  ;  cth  or  greater  sample  thickness.  Thus,  we 
•.dude  that,  contrary  to  previous  expecta- 
;  .  n.s.  ’  the  absorption  coefficient  of  a-Si  can 
r.wtf  a  relatively  strong  temperature  depen¬ 
dence  in  the  highly  absorptive  regime,  and 
n-.’jst  be  co.nsidered  in  laser  processing  of  a-Si. 


In  addition,  by  applying  the  Mott-Davis 
formula  to  the  data  obtained  from  n(T)  and 
k(T),  results  are  obtained  for  the  temperature 
dependence  of  the  optical  gap  energy  tor 
a-Si.  Here,  while  £g  for  a-Si  is,  in  general, 
larger  than  that  for  c-Si,  they  have  similar 
temperature  variation  and  can  all  be  fitted 
using  the  well-known  empirical  formula,^ 
Results  for  Eg(T)  are  shown  in  Fig.  3- 
*iBM  Wijfson  Research  Center,  Yorktown 
Heights,  New  York  10593;  *IBfA  Storage 
System  Product  Division,  5600  Cottle  Road, 
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QTh02  Rapid-scan  femtosecond 
ellipsometry^f  Sii-xGex 
and  Ceo  crystaline  films 

H.  l^Choo,  M.  C.  Downer,  M  P.  Kesan 
Phyiics  Department,  Univer^y  of  Texas, 
Austih, Texas  78712 

Recent  stadrw-Hfave  shown  that  the 
femtosecond  optical  response  of  semicon¬ 
ductors  depends  sensitively  on,  and  can  pro¬ 
vide  a  quantitative  indicator  of,  key  sample 
imperfections  such  as  implantation  damage,^ 
defects  induced  by  low  growth  temperature,' 
nonuniform  doping  profiles,'  and  interface 
roughness.'  In  this  paper,  we  use  an  unampli¬ 
fied  colliding-pulse  mode-locked  (CPM) 
laser  (A,  =  620  nm)  to  obtain  the  femtosecond 
ellipsometric  response  in  air  of  Sii.^Ge,  and 
crystalline  films,  for  which,  to  our  knowl¬ 
edge,  no  previous  femtosecond  studies  exist. 
We  show  that  femtosecond  response  of  Si], 
,Ge,  grown  epitaxially  on  Si  depends  sensi¬ 
tively  on  the  depth  of  the  strained  SiCe/Si 
interface  beneath  the  surface,  while  the  re¬ 
sponse  of  films  depends  on  the  degree  of 
crystallinity  as  determined  independently  by 
x-ray  diffraction.  In  each  case,  the 
femtosecond  response  of  the  samples  of  high¬ 
est  crystalline  quality  reveals  the  basic  non¬ 
equilibrium  bulk  earner  dynamics,  whereas 
the  response  of  less  perfect  samples  reveals 
alterations  in  carrierdynamicsattributable  to 
strained  interfaces,  alloy  separation,  or 
amorphization.  A  "rapid  scan"^  of  pump- 
probe  time  delay  At  provides  rapid  data  ac¬ 
quisition,  The  samples  are  pumped  at  normal 
incidence  and  probed  either  at  near-normal 
incidence  or  at  oblique  incidence  (Q  -■  70') 
through  an  ellipsometric  optical  sequence 
(polarizer,  compensator,  sample,  analyzer, 
differential  detector)  to  provide  cumplete  op¬ 
tical  characterization  of  film  and  substrate  at 
each  At. 


Figure  1(a)  shows  the  femtosecond  reflec¬ 
tivity  response  of  a  complete  compositional 
family  of  optically  thick  Si,.,Get  films  im- 
mediateh-  following  growth  by  molecular 
beam  epuaxv  (.MBE;  on  Si  substrates.  For 
these  samples,  the  strained  SiGe/Si  interface 
lies  well  below  the  optical  probe  depth.  Thus, 
the  carrier  dynamics  probed  are  those  or  re¬ 
laxed  Sii.jGe,,  and  are  understandable  m 
terms  of  the  bulk  band  structure.  Spedfically, 
with  increasing  x,  the  amplitude  of  the  initial 
response  increases  with  increasing  pump  ab¬ 
sorption  coefficient,  and  thus  increasing  ini¬ 
tial  earner  density.  In  addition,  the 
one-component  temporal  relaxation  for  Si- 
like  films  lx  <  0.6)  changes  to  a  two-compo¬ 
nent  response  for  Ge-hke  (.v  2  0.6)  films, 
because  of  a  change  in  carrier  dynamics  re¬ 
sulting  from  different  initial  k-space  distribu¬ 
tions.  In  Ge-like  samples,  direct  L  -*  L  valley 
pump  absorption  creates  sufficient  excess 
hole  energy  to  allow  delayed  secondary  car¬ 
rier  generation  by  impact  ionization  which, 
together  with  interealley  /.  — >  F  hole  scatte  r¬ 
ing,  produces  the  observ  ed  delayed  second¬ 
ary  reflectivity  decrease.  As  the  epitaxial  film 
becomes  thinner  than  an  optical  probe  depth, 
however,  carrier  dynamics  within  the 
strained  interlace  region  increasingly  domi¬ 
nate  the  observed  probe  response  In  these 
cases,  ive  observe  sharply  reduced  carrier 
lifetime,  as  illustrated  by  the  rcflectivjty  re¬ 
sponses  in  Fig.  Ub)  for  strained  800  A  SiGe 
tilms  grown  by  RFCV’D  (bottom)  and  .MBE 
(middle).  The  top  trace  in  Fig.  Ub)  was  ob¬ 
tained  from  the  same  t  =  0.5  sample  as  the 
corresponding  Fig.  1(a)  trace  after  several 
months  of  room-temperature  storage.  A  com¬ 
pletely  different,  small-amplitude,  two-com- 
ponent  response  is  now  observed,  suggesting 
alloy  separation  into  local  regions  of  pure  Si 
and  Ge.  However,  no  such  changes  occurred 
in  the  other  samples. 

Figure  2  (top  trace)  shows  the 
femtosecond  oblique  incidence  response  of 
an  optically  thick  film  grown  by  sublimating 
unpunfied  powder  onto  glass.  X-ray  dif¬ 
fraction  showed  oriented  epitaxial  fee  crystal¬ 
line  structure.''  An  initial  1-ps  recovery  is 
followed  by  a  steady  state,  which  we  tenta¬ 
tively  interpret  as  ultrafast  relaxation  of  ini¬ 
tially  hot  electron-hole  pairs  to  long-lived, 
1,5-eV  direct-gap  band-edge  states  at  the  X 
point,^  The  x-ray  diffraction  of  other  grown 
films  indicated  amorphized  Qo-  If*  these 
cases  the  femtosecond  response  shown  in  the 
bottom  trace  of  Fig  2  was  observed  .-Vl- 
though  the  first  !  ps  is  similar,  continuing 
relaxation  is  observevf  instead  of  a  steady 
state,  suggesting  that  amorphization  induces 
rapid  nonrad iative  recombination  channels. 

IliM  Wnison  Research  Ceiifer,  P.O.  Box  213. 
Yiirktatvn  Hfig/rt>,  NiW  York  10398 

1  VV  Kutt  et  ai,  "Nonlmear  optical  prob¬ 
ing  of  surface  defects  in  semiconduc¬ 
tors  with  femtosecond  pulses,"  in 
Qmintiiin  EU-Ltrouics  and  Laser  Science 
Con'erena'.  Vol.  11,  1991  Technical  Di¬ 
gest  Series  (Optical  Society  of  America, 
Washington  IX),  1991),  paper  QVVD17, 
p  142;  A  Esser  et  ai,  Appl.  Surf.  Sci. 

46.  446  (1990);  u.  C.  Cho  el  at.,  Phys. 
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QThDl  Fic.  1.  0.2-|im  r--Si  ?ample;  (a)  Re¬ 
fractive  index  -as  a  funci:  .r.  cf  temperature, 
(*)  \  ~  0.r?2  um;  (O)  >.  =  i  t?  um.  (b)  Extinc¬ 
tion  coefficient  as  a  function  of  temperature; 
{+)  X  =  0./52  um:  (O)  X  =  M5  pm. 


QTHDl  Fig,  2.  1 .0-pm  a*Si  sample;  <ai  Re¬ 
fractive  index  as  a  function  of  temperature, 
(+)  X  =  0,752  pm;  (O)  X  =  1.15  pm  (b)  Extinc¬ 
tion  coefficient  as  a  function  of  temperature; 
(+)  X  =  0.732  pm;  (O)  X  =  1,15  pm  solid  curve 
is  a  quad  ratic  polynomial  fit  for  X  =  0.752  pm 


OThDl  Fig.  3.  Optical  gap  eriergv  t.  as  .r 
function  of  temperature  for  Si  at  X  =  0  752  u.m, 
(+)0.2-pm  a-Si  sample  10)  1.0-pma-Si  xamrle 
(O)c-Si. 
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QThD2  Fig.  1 .  Femtosecond  reflectivity  re¬ 
sponse  of  the  SiGe  alloys.  Numbers  represent 
the  Ge  content  in  each  alloy.  The  reflectivities 
at  time  delay  zero  are  displaced  for  claril  v.  (at 
Relaxed  optically  thick  SiGe  alloys  frcshl\- 
grcnx  n  hv  MBE,  (b)  Femtosecond  refipctivii'. 
response  of  the  strained, opticallv  thm  alloys 
The  307r  alloy  is  the  same  sample  as  in  to) 
after  6  months. 


QThD2  Fig,  2.  Ferntosecond  reflei  tmtx  re¬ 
sponse  of  the  Qo  films  using  obliqi  1 1  ■  i lu ; o e;’, ; 
angle  probe;  (a)  crystalline  film  oi  gl.ixs. 
(b)  amorphusized  Cm  film  on  gh“- 
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QThD3  Diffuse  transmission 

spectroscopy  of  structure 
In  dense  colloids 

P  D  Kaplan,  A.  C.  Yodh,  D.  J.  Pine 
Dcj'artment,  Ur.r.ertUu  of 

209  S.  35‘.:  Sl'c-:!,  Philadelphia, 
,‘\-.i:<i,I:'.ini,'i  19109 

Multiple  light  scattering  spectroscopies  are  of 
growing  importance  to  the  laser  community 
tveause  of  their  potential  to  provide  irdorma- 
lion  on  dense  random  media,  and  because  the 
behavior  of  diffuse  light  is  interesting  in  its 
own  right.  In  this  contribution,  we  show  that 
it  is  possible  to  derive  quantitative  informa¬ 
tion  about  the  microscopic  structure  of  a 
dense  colloid  by  measuring  its  transmission 
.1-  .1  lunction  of  wavelength. 

i  'hoton  transport  in  isotropic  random  sys- 
such  as  dense  colloids,  is  characterized 
by  two  quantities:  the  photon  random-walk 
step  length  l'  and  the  photon  absorption 
length,  it  has  been  previously  recognized  that 
/’  is  related  to  an  integral  over  the  material 
interparticle  structure  factor,  Sfij).’  This  fact 
has  been  used  to  correct  for  the  effects  of 
particle  structure  in  diffusion  measurements, 
but  generally,  the  effect  is  viewed  as  a  nui¬ 
sance.  We  demonstrate  by  experiment  and 
theory  that  measurements  of  the  wavelength 
liependencc  of  /*  can  provide  information  with 
high  sensitivity  on  the  structure  factor  of  the 
dense  colloid. 

To  visualize  the  connection  between  inter¬ 
particle  structure  and  photon  transport,  con¬ 
sider  a  photon  impinging  on  a  small  group  of 
particles  I  Fig.  Uall.The  probability  of  scatter¬ 
ing  through  wave  vector  <j  is  given  bv  the 
product  of  the  individual  particle  form  factor 
K  Ml  interparticle  structure  factor 

S(i;).  The  form  factor  depends  on  wave  vector 
kg  as  welt  as  particle  diameter  a.  In  multiple 
scattering  experiments,  this  type  of  scattering 
occurs  many  times,  and  the  formal  expres- 
-.lon  fur  l/f*  is  proportional  to  an  integralover 
r-i.j)  with  a  weighting  function  .jV^^fq),  i.e., 

f  qVF,^{qa)S[aa)d(qa), 

where  p  is  the  particle  number  density.  Dif¬ 
fuse  transmission  spectroscopy  works  be¬ 
cause  the  upper  cutoff  of  this  integral  (2k;fl) 
depends  on, the  photon  wavelength.  Thus, 
the  value  of  /'  at  every  wavelength  is  an  aver¬ 
age  over  different  parts  of  the  structure  factor. 

In  this  experiment,  we  measure  the  wave¬ 
length-dependent  transmission,  between  450 


and  900  nm,  of  an  optically  dense  slab  ot 
polystyrene  spheres  (essentially  hard 
spheres)  suspended  in  water.  The  light  is  col¬ 
lected  [Fig.  1(b))  using  an  integrating  sphere 
and  a  photodiode.  The  data  in  Fig.  2  were 
taken  from  a  cell  with  volume  fraction  0  * 
0.33,  a  »  0.460  pm,  and  cell  thickness  L  =  500 
pm.  The  transmission  coefficient  T  derived 
within  the  diffusion  approximation  is  pro¬ 
portional  to  I'}  T  ^  51  /3L  »  0(/'/L)T  The 
solid  line  in  Fig.  2  shows  a  theoretical  predic¬ 
tion  for  T  using  the  Percus-Yevick  approxi¬ 
mation  to  calculate  the  structure  factor  and 
.Mie  theory  for  the  form  factors.  The  dashed 
tine  is  calculated  withS(ii)  a  1.  Clearly.diffuse 
transmission  spectroscopy  can  test  the  valid¬ 
ity  of  different  models  for  the  material  struc¬ 
ture.  The  weak  oscillations  of  T  with  respect 
to  wavelength  reflect  the  periodicity  of  Siq) 
and  The  detailed  role  of  wavelength  is 

discussed  further  in  Fig.  3. 

In  the  future,  diffuse  transmission  spiec- 
troscopy  will  enable  us  to  study  a  wide  range 
of  systems.  Although  the  system  studied  here 
is  essentially  hard  sphere,  it  may  be  possible 
to  observe  the  fluid/glass/crystal  hard- 
sphere  transitio,  s  in  simibr  colloids. 
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Harrison  M.  Randall  Laboratory  of  Physics, 
Uniivrsity  cf  Michigan.  Ann  Arbor, 

Michigan  48109-1120 

The  presence  of  strong  magnetic  fields  signif¬ 
icantly  modifies  electronic  stales  of  a  semi¬ 
conductor,  such  as  quantization  of  energy 
levels  (Landau  leveb)  and  shrinkage  of  exci- 
ton  wave  functions.^  Consequently,  signifi¬ 
cant  changes  in  optical  as  well  as  transpwrt 
properties  of  the  material  are  expected.  In  this 
paper,  we  report  observations  of  an  enhanced 
nonlinear  optical  response  and  a  dranuiiic 
reduction  of  mobilities  for  magnetoexcitons 
in  GaAs.  We  also  discuss  nonlinear  optical 
measurements  of  excitons  associated  with 
higher  Landau  levels. 

Our  measurements  are  based  on  frc; 
quency-domain  four-wave  mixing  (FWM),* 
and  are  carried  out  as  23  K  on  0.2-pm  thick 
GaAs  film  grown  by  molecular  beam  epitaxy. 
Al  zero  field,  the  sample  Is  homogeneously 
broadened,  and  has  a  1-s  exciton  absorption 
line  width  of  0.2  meV  as  shown  In  1(a). 
The  light  heavy-hole  degeneracy  is  lift^  by 
strain  introduced  during  the  etching  process. 
The  linear  absorption  displays  a  rich  struc¬ 


ture-  lit  6  T  (■rt.i*  l  ig  i(bi|  Ihv  le,injrt-^  are- 
likely  the  combined  rc*sult  of  strain  and  im¬ 
purity-bound  excitons.  However,  the  noniin- 
e<rr  optlc.il  response  [Fig.  l(c)J  obtained 
through  c  w  degenerate  FW.M  exhibits  a  quite 
different  structure.  The  energy  separation 
and  the  magnetic  field  dependence  of  the 
dominant  p.-aks  in  the  nonlinear  sjiecirviin 
suggest  that  these  peaks  are  due  to  mag- 
netoexcilons  assuciated  with  Landau  lest-is, 
indicating  a  significant  enhancement  of  the 
excilonic  nonlinear  magneto-optical  re¬ 
sponse  above  the  heavy-hole  exciton  coin- 
pared  to  the  nonlinear  response  at  0  T. 

The  exciton  diffusion  coefficient  is  ob¬ 
tained  using  standard  nearly  degenerate 
FWM  metheids  in  which  grating  decay  rates 
are  measured  as  a  function  of  the  grating 
spacing.^  Fig.  2(a)  and  (b)  display  nearly  de¬ 
generate  FWM  line  shapes  obtained  at  two 
different  tield  strengths.  Tlie  width  of  the 
response  is  given  by  P  =  y  =  4n*D/A^,  where 
V  is  the  excii.Tion  decay  rate  and  A  is  the 
grating  spacing.  Dramatic  field-induced 
changes  in  the  grating  decay  rate  are  evident 
in  Fig.  2lb).  Figure  2(e(  is  a  plot  of  the  mag¬ 
netic  dependence  of  the  exciton  diffusion  co¬ 
efficient  which,  at  1  T,  is  reduced  by  a  factor 
of  5  compared  to  the  value  at  zero  field.  The 
reduction  of  the  exciton  mobility  is  likely  the 
result  of  the  smaller  Bohr  radius  in  the  pres¬ 
ence  of  a  magnetic  field.  In  this  case,  exciton 
scattering  by  weak  or  small-scale  disorder 
may  become  important.  .Vote  that  the  smaller 
grating  divay  rates  also  lead  to  enhanced 
nonlinear  optical  responses  for  mag¬ 
netoexcitons 

Nearly  degenerate  RV.M  measurements 
on  excitons  associated  with  the  second  and 
third  Ltndau  levels  also  reveal  interesting 
phenomena.  First,  decay  of  the  cw  nonlinear 
response  is  of  the  order  of  the  exciton  recom¬ 
bination  time.  Second,  measurements  using 
selective  optical  excitation  show  that  the  non¬ 
linear  signal  arises  from  excitons  associated 
with  the  o'  transition.  If  the  nonlinear  optical 
response  is  primarily  due  to  phase-space  fill¬ 
ing  and  exchange  effects,  the  first  obserx-ation 
would  imply  that  the  inter-Landau-lexrel 
transfer  rate  is  slower  than  or  comparable 
with  the  exciton  recombination  rate.  If  the 
inter-Landau-level  interaction  is  responsible 
for  the  nonlinear  optical  response  of  excitons 
associated  with  high  Landau  levels,  the  sec¬ 
ond  observation  ivould  suggest  a  spin-de¬ 
pendent  interexciton  interaction.  Further 
studies  including  magnetoexciton  photolu¬ 
minescence  and  cw  pump-probe  measure¬ 
ments,  which  are  helpful  in  understanding 
current  nonlinear  measurements,  are  also 
presented. 
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Tony  F.  Heinz,  IBM  Watson  Research 
Center,  Presider 
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QF81  Ultrafast  electron 
dynamics  and 
recombination  on  the 
Ge{1 11)2x1  n-bonded 
surface 

Richard  Haight,  Martina  Baeumler 
IBM  Watwn  Research  Cet.Ur,  F.O.  Box  218. 
Yorktoti’n  Heights,  New  York  10598 

Elucidation  of  the  microscopic  scattering  and 
energy  loss  processes  involved  in  ultrafast 
electron  dynamics  is  crucial  to  the  under¬ 
standing  of  macroscopic  nansport  in  semi- 
conductoi  systems.  In  a  semiconductor 
heterostructure  system,  for  example,  an  elec¬ 
tron  that  crosses  the  interface  between  the 
two  materials  will  scatter  from  the  band  min¬ 
imum  of  the  substrate  to  a  new  band  in  the 
overlayer,  a  process  that  may  be  complicated 
by  interfacial  disorder  and  interddfusion. 
The  surface  of  a  semiconductor,  in  this  case 
the  cleaved  Gedll)  2  x  1  surface,  provides  a 
particularly  interesting  model  system  for  the 
study  of  such  processes.  Upon  cleavage,  one- 
dimensional  chains  of  >t-bonded  atoms  are 
formed  whose  electronic  states  disperse  into 
the  bulk  band  gap  of  Ge.  The  processes  that 
couple  bulk  excit^  electrons  into  statesat  the 
surface  are  operative  in  a  wide  class  of  sys¬ 
tems. 

We  used  angle-resolved  laser  photoemis¬ 
sion  techniques  to  examine  the  ultraiast  elec¬ 
tron  scattering  and  recombination  processes 
on  the  Ge(111)  x-bonded  surface  with  sub¬ 
picosecond  time  resolution.  Electrons  pho- 
toexcited  into  the  bulk  Ge  conduction  band 
scattCT  into  the  unoccupied  surface  antibond¬ 
ing  Jt  band  whose  minimum  is  at  the/ point 
in  the  surface Brillouin  zone.  Rapid  relaxation 
to  the  surface  band  minimum  is  followed  by 
a  unique  phonon-assisted  process  in  which 
electrons  recombine  with  bulk  holes  at  the 
valence  band  maximum,  which  we  fine  to  be 
the  primary  mechanism  responsible  for  the 
decay  of  the  transient  x  population.  Time-de¬ 
pendent  measurements  at  300  K  and  120  K 
have  been  performed  to  determine  the  role  of 
energetic  phonons  in  the  scattering  processes. 
These  processes  are  modeled  with  a  set  of  rate 
equations,  whose  fits  to  the  data  yield  scatter¬ 
ing  times  used  to  directly  determinea  surface 
recombination  velocity.  Ultrafast  surface- 
state  hole  dynamics  are  observed  and  a 
renormalization  of  the  surface  band  gap  is 
studied  as  a  function  of  electron  density.  Since 
the  R-bonded  states  are  fundamentally  one- 
dimensional  In  nature,  Ihese  results  represent 


the  first  such  studies  of  bandgap 
renormalization  m  a  one-dimensiunai  sys¬ 
tem. 

Fraunhofer  Institute  fur  AngewanJte 
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Physics  Department,  University  of  Texas. 
Austin,  Texas  78712 

With  the  advent  of  high-mtensity 
femtosecond  sources  has  come  the  ability  to 
excite,  on  a  subpicosecond  time  scale,  the 
electrons  in  a  solid  to  temperature  T,  greatly 
in  excess  of  the  lattice  vibrational  tempera¬ 
ture  T|.'  In  a  previous  experiment  on  Ak.‘  it 
was  shown  that  non-«quiiibnum  heating  of 
the  electrons  leads  to  subpicosecond  therm¬ 
ionic  emission,  but  that  space-<harge  fields 
distort  the  iiutial  thermal  distribution.  Here, 
we  achieve  quantitative  understanding  of  the 
non-equilibrium  thermionic  emission  pro¬ 
cess  by  simultaneously  measuring  the  reflec- 
tiv-ity  of  the  incident  Ught,  the  total  electron 
yield,  and  electron-energy  distribution 
curves  (EDCs)  of  the  emiti^  electrons.  The 
results  are  modeled  with  an  analytic  Kichard- 
son-Dushman  type  equation  and  with  parti¬ 
cle  simulation  codes  which  include 
space-charge. 

The  non-equilibrium  excitation  is 
achieved  with  lO-Hz  YAG-pumped,  multi 
stage  amplification  of  2.0-eV  colliding-pulse 
mode-locked  (CPM)  pulses.^  Average  inten¬ 
sities  in  each  80-fs  pi^  a,o  varied  up  to  4.3 
X  10'‘  W/cm^  (35  mj/cm^  fluencc)  on  the 
sample  surface.  Total  yield  is  determined  by 
measuring  the  average  current  through  the 
sample  to  ground,  whereas  the  electron-en¬ 
ergy  distributions  are  obtained  from  tinie-of- 
flight  (TOF)  spectra  measured  with  a 
multichannel  plate  coupled  to  a  transient  dig¬ 
itizer.*  All  data  reported  here  were  taken  with 
the  laser  intensity  kept  below  the  cumulative 
damage  threshold. 

We  find  that  the  temperature-dependence 
(deduced  calorimetricaily  from  the  absorp¬ 
tion  of  the  metal)  of  the  total  yield  is  in  go^ 
agreement  with  a  space-charge-limited  ex¬ 
tension  of  the  Riefuudson-Dushman  equa¬ 
tion  for  thermionic  emission.  For  a 
square-wave  temperature  pulse  of  duration 
T„  the  totjJ  number  N  of  emitted  electrons  is 
given  by:’ 

N-409Ro(kBT,)lnn  +7  76 

xlo‘Ro^(kBT^e'*»^*^') 

Here  Rq  is  the  radius  (in  microns)  of  the  spot 
being  heated,  EgT^and  ^aie  the  temperature 
and  work  function,  respectively,  in  electron 
volts.  For  reasonably  hi^  temperatures  (>0.2 
eV  for  a  typical  metal),  the  yield  is  very  nearly 
linear  with  temperature.  Figure  1  has  plotted 
the  measured  and  theoretical  Eq.  (1 )  yields  for 
A1  (110)  using  a  measured  value  of  Ro  =  15ft 
)im  and  assumed  values  of  to  =  O.o  ps  and  % 


4  eV  .Sure  that  ihe  yield  is  not  very  sensitive 
to  the  last  two  values  since  they  appear  inside 
the  log  term,  and  hence  it  is  a  simple  and 
direct  measoreof  the  r.on-equilibnum  rise  T, 
rise  gixxl  itgreemeni  between  t.He  two  turves 
and  the  linear  behavior  of  the  experimental 
results  for  Ihe  tughest  temperatures  establish 
the  space-charge  dominating  behavior  to  the 
yield  as  described  by  fcq  (1/ 

Comparison  of  the  total  vteld  with  the 
TOF  spectra  indicates  that,  (or  a  given  tem¬ 
perature  increase,  the  electron  distributions 
are  skewed  by  the  space-charge  fields  to  ki¬ 
netic  energies  (KEs)  much  higher  than  the 
corresponding  theimaj  distnbutjon.  This  is 
seen  in  TOF  data  from  Al  !!10),  shown  in 
curves  (a)  and  (c)  of  Fig  2  The  curve  below 
each  dau  set  is  a  Boltzman  distribution  al  the 
same  temperature  T,  as  the  data,  where  T,  has 
been  deduced  from  the  simultaneously  mea¬ 
sured  total  yield  and  Eq  (1)  In  addition  to  the 
electrons  at  much  higher  KEs.  a  sharp  quan- 
tum-exa ration  peak,  most  likeiv  due  to  a  sur- 
late-rvlated  state,  al  a  KE  of  1.5  eV  is  also 
fvidc.nt  a.tprcscr;. particle  siiiiuiatiui,vxxM.x 
are  being  used  to  further  understand  the  mea  - 
sured  thermionic  disinbutions 
Physics  Department,  latorenct  Livermore 
National  Laboralorv.  Livermore,  California 
94550 
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We  h.ive  measured  the  evolution  of  f)ie  elec¬ 
tron  energy  distribution  excited  in  a  300-A  Au 
film  by  a  1 80-fs  optical  pulse.  The  distribution 
is  measured  directly  by  uv  photoemission 
with  a  time  resolution  of  270  fs  and  an  elec¬ 
tron  energy  resolution  of  -0.05  eV,  Earlier,  we 
reported  the  first  time-resolved  photoemis¬ 
sion  measurement  of  laser-heated  metals 
with  a  time  resolution  of  -TOOfs.  We  found 
that  the  distribution  function,  especially 
around  0.3  eV  above  the  Fermi  level  deviated 
from  Fermi-Dirac  until  -800  fs  after  the  heat¬ 
ing  pulse.’  Here,  with  better  time  resolution 
we  a  re  now  able  to  observe  (a)  evidence  of  the 
nascent  (as  photoexciled)  electron  distribu¬ 
tion,  (b)  the  time  evolution  from  the  nascent 
to  X  Formi  Dirac  distribution,  and  (c)  that 
thermalization  is  more  rapid  for  liigher  exci- 
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QFB2  Fig.  J.  Total  yield  vs  electron  temper¬ 
ature  for  Aid  10):  •  measured  yield  vs  temper¬ 
ature  rise  T,  deduced  from  the  absorptivity  of 
the  metal .  For  all  data  shown,  the  absorptivity 
i.s  negligibly  different  from  the  low-intensity 
absorptivity  of  Al.  The  solid  line  is  calculated 
from  Ec]  (I) 


QFB2  Fig.  2.  Time-of-flight  spectra  from 
AIdJC)  for  two  different  electron  tempera¬ 
tures  T,.  Curves  (aland  (c)  are  measured  TOF 
spectra  corresponding  to  temperature  rises  of 
0.45  and  0.73  cV,  respectively,  deduced  from 
simultaneous  total  yield  data  and  Eq.  (1). 
Curves  (b)  and  (d)  are  Bolizman  TOF  distri¬ 
butions,  also  corresponding  to  temperature 
rises  of  0.45  and  0.73  eV  respectively.  Also 
labeled  are  kinetic  energies  corresponding  to 
the  various  peaks  in  the  TOF  spectra. 
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'  /it  li  1 1  tl.’\  t'is.  1  hftiC  ri.‘>U  ;£>  lire  ;  t  i  f!\i  rke’ii  Cel  It ' 

iriii  to  cxpenn'icntt-  m  whicn  !enitut.tConJ 
ijiblc  reflection  'jpectro>eopy'  was  ow'd  to 
Jceiuce  the  electron-phonon  constant  cou¬ 
pling  a’  "'th  a  model  that  asiuines  that  the 
electrons  are  thermalieed  much  taster  than 
the  duration  of  the  optical  pulse  These  re¬ 
sults  shove  that  electron  thermalization  oc- 
:urs  on  the  same  time  scale  a> 
■-  iectri'n-phonun  Csiupline;  ar.vi  tnai  electron- 
Lucieo  eiieri^)  transler  is  much  more  compli¬ 
cated  than  previouslv  thuue,nt 

Typical  tinie-resolv  ed  photoenussK'ii 
'pecira  are  shown  in  Fig  1  tor  tiiree  time 
deliO. '  relative  to  the  arrival  ot  the  laser- heat- 
me  piii'C  The  energy  is  referred  to  f'f,  the 
rermi  energy.  The  laser  heating  pulse  was 
generated  by  amplifying  the  output  of  a  tv,  o- 
:et  prism-compensated  synchronously 
pumped  dye  laser  operating  at  TS4  eV'  The 
-.utociirrelation  FWHM  was  14  »  ISO  Is.  The 
,n  paUe  !s32  e\')  wa-  generated  bv  tre- 
oc.i V  -trioiing  the  st  eV  pu'ot  in  tv.m  UBCK 
o,u-  i’he  uv  pulw  f-V'.  HM  u.w  To)  ts, 
deU.iCed  In  observ  ing  a  -ist.-ts  1  '.VHM  crosi- 
vorrelation  w  ith  the  lundaineiitj!  ITiis  dura¬ 
tion  wa^  consistent  with  the  BBO  ervsial 
length  The  heating  and  uv  puUe  were  over¬ 
lapped  on  a  300-A  .Au  film  kept  in  a  uhv 
chamber  The  photoemitted  electrons  vsere 
detected  with  a  time-of-flight  detector  with 
resolution  of  -0.05  eV  at  Ep  in  the  direction 
norma!  to  the  Au  surface  in  a  2'  cone  angle 
The  deviation  from  Fermi-Dirac  is  be*! 
ebsi  reed  on  a  logarithmic  scale.  In  Fig  2,  we 
'll  ■■■.  the  time-evolution  ot  the  lilm  tor  on 
.losi-rfed  tlueiice  01  12U  pi,/cm'  Tlie  best-t.t 
rerir.i-Uirac  tuiiction  is  shown  indashed  tine 
The  data  are  nornialued  with  ri-ipect  to  tiie 
dvr.sitv  ol  states  tor  which  we  have  solved 
self-consistently.  At  l  =  133  fs.  a  plateau  from 
0  7  to  1.8  eV'  reaches  its  maximum  value,  it 
there  vvere  no  energy  relaxation,  the  distribu¬ 
tion  would  be  the  integrated  nascent  distnbu- 
tion,  a  flat  distribution  extending  from  0  to 
1  34  e\'  (the  pump  photon  energy),  with  a 
v  alue  of  0.0028.  VVe  can  easily  see  the  one- 
phoioncutoff  thatisasignature  of  the  nascent 
di'inbution.  The  level  is  within  a  factor  of 
t,.o  il  wh.it  we  d  expect  with  no  energy  rc- 
i.ix.i’.ion  This  observation  suggests  that  the 
eiiercv  relaxation  time  for  the  1.8-eV'  elec¬ 
trons  IS  about  one-half  the  pump  pulse  dura¬ 
tion,  e  g.,  -100  fs.  This  value  is  consistent  vvilli 
energy  range  experiments  '* 

At  t  =  400  fs,  the  Femii-Dirac  temperature 
that  tits  the  data  best  reaches  its  rtiaximum, 
775  K  The  maximum  temperature  calculated 
from  the  heat  capacity,  assuming  no  energy 
transfer  to  the  lattice,  is  770  K.  There  is  still 
iif  j.r„  ir-ately  30%  ol  the  energy  in  the  hot 
tail  Bccau.se  the, energy  relaxation  time 
shi  uld  scale  as  E'~,  we  expect  the  0.9-  and 
7--.-\’  relaxation  times  to  be  ~4i'i0  and  ~o7(' 
r-  w  inch  are  consistent  with  the  data.  Bv  ,5iX’ 
Is.  tile  competition  between  electron  the- 
mializahon  and  electron  energy  transfer  to 
th-c  l.altice  becomes  important.  The  electron 
avstein  loses theenergybeforeh'.iT’  'heTr^i- 
ize:  even  at  1.3  ps  there  is  a  nonthermal  tail. 

In  Fig.  3,  we  show  data  similar  to  that  in 
Fig  2,  except  that  the  ab  orbed  fluence  was 
2.5  X  higher,  300pJ/cm^.  The  plateau  from  0.8 
to  1  84  eV  has  the  same  shape  as  in  the  lower 
tluence  case  and  approximately  twice  the  am- 


phruJe  eoiubteiit  with  the  taiei.vv  i  ht  e.'i 
ergv  relaxation  might  be  taster  tn  this  c.i-. 
biv-iuse  there  are  mote  eievirons  above  r  ^ 
vv  ith  vs  hich  to  scatter  In  this  case,  oniv  ei.-, 
trolls  0!  7d  5  eV  have  to  relax  for  the  disii  ibu- 
tion  to  thermahzf  By  8t>7  rs  the 
thernialiaation  seems  complete  The  etiKtn  ii 
temperature  maximum  calculated  tfom  the 
heat  capacity  is  1200  K  The  jvak  teintvr.iture 
teached  at  400  fs  is  1700  K  ITespite  v-.  ii.i' 
,ippe.i.'s  tvi  be  good  agreement  with,  a  f  eni., 
Thrac  distribution  by  tio7  is.  ti.eekx  iren  teni 
peratures  that  fit  the  data  vlo  iu<t  obe-.  rhe 
standard  electron- phonon  model  e  g 
C^oT^/ot  =  -ytT,  -  T.J  =  -C.  d  T(  t)r  until  r  -- 
Is  This  discrepancy  is  currentlv  being  inv  <> 
ligated  It  may  be  related  to  the  olwerv.ition 
that  the  best  Fermi-Dirac  fit  is  at  a  lemjvra 
ture  much  higher  than  that  expected  fri'm  the 
heat  capaaty 

This  study  shows  that  ihernuli ration  ot 
electrons  in  a  metal  occurs  rebtieelv  sK'wiv 
It  ..h.illenges  the  standard  cUciroti- phonon 
-.  upiing  iiuiJel  It  opens  the  |'^>■.•.|i■l;llv  lii.ii 
,'.:irash4irt  puhixi  excitation  on  tiu-i.i’s  pro 
duces  large  numbeis  of  eliviroiis  vsitti  n.i- 
scent  energy  distribution  lor  -liH)  Is  !:  n-.av 
be  crucial  in  understanding  the  recent  obser- 
vaiion  of  enhanced  chemical  reaction  rates  ol 
absorbed  molecules  on  metal  surtacos  irradi¬ 
ated  with  ultrashort  pulses. 
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QFB4  Heating  and 

hydrodynamic  expansion 
of  intensely  irradiated 
metal  targets  measured  by 
femtosecond 

time-resolved  reflectivity  ' 

,X  Y.  Wang,  H  Y  Ahn,  M.  C.  Downer  / 
P/iysics  Department,  University  of  T exas. 

Auftm,  fexiii  7$P^2 

Intense  femtosecond  excitation  of  metal  tar¬ 
gets  has  attracted  fundamental  interest  in  the 
properties  of  solid-density  plasmas'"’  as  well 
as  practical  interest  in  the  generation  ot  ultra¬ 
fast  vuft  A-»ay  pulsea.  Ill  jc-  cral  le.eiit  stud¬ 
ies,'*  the  ulfrafast  response  of  reflectivity 
from  metal  targets  was  observed  with  a  single 
intense  pulse  serving  both  a,s  target  excitation 
and  as  opbeal  probe  Conseijuentiy,  it  was 
difficult  to  distinguish  the  competing  pro¬ 
cesses  of  electron  heating  and  hydrodynamic 


>v,,  4,4,4  ‘  .....  ..  .,;,,U4..  ,4  li.v  i-;.- 

,  a.-  .'4  !,w  .Mil'll,  111  JeClvx  V  .illU  ,11  dll 

ten  id  hllisw 

VVe  Ic^H..’-!  IelllloM'4. 44.n.a  Ml:4  ic>.ii'4  ed  1  v 

px-rMieiiO  111  wiiun  a  '^.j  w  r./.i.r.-n,  ^’-p-oLir 
i/t'd  piuinp  pulse  4,5  in.  idri  we  e  xCites  Ag  Ai, 
4.1'f  Vv  sutt.Kes  4i!  ; ntelisil ;<'“s  up  to  5  *■  lii 
Vv  .Ill'  1  he  felie.iiv  itv  ul  .4  v\  eak  s- or  p- p4.;'- 
laru'i  .l  p!44i>e  pLii-e  u.,  j.h  r..  e  nP, 

nin  lie  vised  1  l  ,4  '.p'.e.  ... nlefej  .  1I  tile-  pulll’v 
.s  t-'en  111.  lid  d.ir.;lg  tile  puinp-ptuOe 

;.i..e  ele,../  M  II,  ,p-;,.-,''ps  Hlesallipk 

IS  i.isieie  .i  dur.iig  el.ila  ..i;k-e<i.i;i 

4 . 1  n  e- 1  e‘ s.  ■;  V  e-,i  I  -  petia  1  ic  I'C  1  et lee  lie  11 V 
J.it.i  loi  tile  ne.UiV  t.'cx'  eieciodi  metals  Ag 
and  .\1  iw.eiteei  at  'i.i''  Vv ,  eiii'  ate  sheiwn  in 
1-ig  liji  .inj  i.bi  re-Sf'rC(:\  ell  The  tetievlivilv 
sfecreases  wiih  puise-w  jdlh-Iinuied  tali  nnu 
Imm  an  iniii.i!  eaiue  ot  0  86  i.Ag.i  or  0  SO  (Ait. 
reaching  a  tninimuir.  ot  about  0  52  m  .M  -  i 
(w  toliowed  bv  .1  muih  slower  fsartia!  rt'Cuv 
etv  liunng  1  •  .M  <  ’0  ps  Bevxnvd  10  ps 
rv'iit-. *.iv  itv  U'v.ls  .'ii.  v-r  devre.iM-s  siighllv  as 

Ilu- ..l-i.lll.'i'  .  tx'.'. '1144-14  Oplw.olv  lill.  k 

iite  re!w  'iivitv  J.it.i  .ire  atialwed  ..i  t.iin 
lime  vii  i.n  nuiirei  w.i.Sv  ss  iv  ing  the  Heim 
hoi!,'  .ive  e.juaiK'iis  ■  Hie  incdel  .iicluvle' 
iii'l.iiiMiuvi-iis  iie.iting  .ind  s.;bst  uuei!t  cool 
ii’g  ot  tile  eivsitons  and  developmeiil  oi  a 
plasma  gradient  that  expands  at  vonsiani  v  e- 
locitv  normal  tvithe  metal  surt.ue  I  hi'evsilv 
ing  material  ['aranieters  are  ek x  Irvin  .oilisu'i' 
trei-jiieny.'v  v,  plasma  irecjueiicv  ;■  irvl.ited  bv 
me  Drude  dielectric  l unction)  and  exp.inding 
plasm.i  gr.ijienl  sc.ilo  length  I,  Tlie  dotted 
suivv-s  in  I  ig  Ii.dai.J  fbj  were  derived  onlv 
on  the  bases  ot  el.-ctt>di  lie.iting  (/.  s  0)  vieio- 
inc  .1  piiKe-w  idlii  iiiniteii  i.iK  time,  as  ob- 
serve.i  IKiwi  ver,  an  ’inreasonably  long 
eiei  Irvin  reUs.iiii.n  lime  (-)('  ps)  is  rixjuirod 
lv>  III  the  rsMAerv  Alu  rn.lliv  v  Iv,  the  dashed 
s ones  111  1  Ig  l(,i).ind  (bishow  the  reflectiv¬ 
ity  response  oi  Ag  and  Al.  considering 
plasma  exp.insion  alone  at  a  velivcilv  of  ap- 
proximvitely  the  speed  of  sound.  But  this 
model  alone  cannot  account  for  the  rapid  fall 
time  without  invokmgan  unphvsical  plasma 
expanding  speed  >10'  cm.'s  vN'everlheless 
when  both  elfects  are  i.iken  mio  account,  a 
very  satisfying  fit  for  .\g  and  Al  is  obtamed, 
.IS  shown  bv  the  solid  curvn-,  m  Fig  1(a)  ,iiid 
(b). 

rinie-ivsoK  ed  viata  (v.>r  the  nonfree-elec- 
Iron  metal  VV  are  shown  m  Fig  2.  In  contrast 
to  Ag  anvl  Al,  under  the  same  intensity  exci¬ 
tation  lo' '  VV,  cnT  IFig.  2sa)l,  the  reflectivitv 
increases  sli.irpiv  troni  an  initial  value  of  (1  2b 
lo  a  peak  value  of  -0  25,  indicating  conver¬ 
sion  to  .1  more  reflective  metal  (higher  free- 
clectron  density)  upon  melting.  The 
subsequent  rvllectivity  drop  and  recovery 
however,  elosely  resembles  tint  observed  in 
Ag  and  Al,  and  is  siniilarly  expl.imed.  s\i 
lower  111 leiisitv  15  X 10  '  VV.'cm*:  see  Fig  2(b!!. 
the  retloc*''  'tv  only  increases  and  stavs  high, 
indicating  melting  vvith  mmimal  subsequent 
liydrodyn.vniic  expansion  In  all  cases,  s-  po- 
lari/ed  reflectivitv  changes  very  bltle,  in  ap 
proxnnate  igreenieni  with  the  inovj.i 
presented.  Tlie  calculated  curves  in  Fig.  3 
show  a  d liferent  tempsoral  response  expected 
with  k  =  31(1  iim  probe  for  Ai  Experiment 
w  illi  a  uv  probe  are  being  performed  to  v'crifv 
the  proposed  interpretation. 
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QFB3  Fig.  !.  Electron  energy  distribution 
function  vs  energy  for  l<iser-c\citcd  300-.A  Au 
film  for  three  timedel.iys  Tre  film  was  evap¬ 
orated  on  a  sapphire  suh't'.itc  and  was 
cleaned  by  annealing  in  uhv.  The  absorbed 
fluence  from  the  laser  heating  pulse  was  120 
pj/cm^  The  spectra  were  obtained  using  uv 
photoemission.  The  uv  flucnces  were  kept 
low  enough  to  avoid  space- charge  effects. 
Each  spectrum  is  the  accumulation  of  25  000 
electrons. 
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QFB3  Fig.  2.  Electron  energy  distribution 
function  vs  energy  for  Au  film  with  120- 
iij.^cm^  absorbed  laser  fluence  at  five  time 
delays  The  vertical  Kale  is  in  units  of  the 
density  of  states.  The  artifact  coming  from 
two-photon  Uii  ♦  3a))  photoemission  from  lev¬ 
els  beloi  Epcan  be  no  larger  than  one-half  the 
I  8-cV  signal  at  (  =  0  fs,  e  g.,  <  0.0005  at  /  =  0 
fsand  <0.DX)35by(  =  133  fs.  TheFermi-Dirac 
fits  shown  m  dashed  line  are  fit  with  T,  =  516 
K,  562  K,  706  K,  550  K,  and  470  K,  respectively. 
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QFB3  Fig  3  Electron  energy  dtstnbuiion 
function  vs  energy  for  Au  films  with  .'(«i 
pj/cm*  absorbed  laser  fluence  at  live  nr,  < 
delays.  The  fermi-Dirac  fits  show  n  :n  da'-hi  ,1 
line  are  fit  vMth  T,  r  753  K.  1238  K.  I'V-  K 
1684  K,  and  1274  K,  respect! vciv 


QFB4  Fig.  1.  Time-resolved  re’flectivity  of 
(a)  Ag  and  (b)  A1  surfaces  pumped  by  p-po- 
larized,  90-fs,  620-nm  pulses  incident  at  45' 
from  the  normal  at  peak  intensity  lO'^ 
W/cm",  and  probed  by  ;  -rolari/ted  pulses 
(data  in  squares)  of  same  duration  and  wave¬ 
length  incident  at  ^9  \  Each  data  point  Ls  av¬ 
eraged  over  20  shots  Curves  represent 
theoretical  models  based  o."  numerical  solu¬ 
tions  of  Helmholtz  eq.nuons,  as  follows: 
electron  heating  include. 1  rrasimum  reflec¬ 
tivity  drop  shc-.T.  rorre^ronds  to  u'w  from 
0  06  fo  1.12  for  Ac  and  from  0  35  to  2.41  for 
Al),  plasma  expansion  neglected  (dotted 
curves);  plasma  expansion  included  (with 
sound  speed  of  1 .17  »  IC'  cm  s  for  Agand  1,5 
X  10*’  cm/s  for  Al),  electron  heating  neglected 
(dashed  curves):  electron  Heating  and  plasma 
expansion  both  included  i=.-/:d  curves) 
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QFB4  Fig.  2.  Time-resolved  reflectivity  of 
IV  under  identical  conditions  as  for  Agand  Al 
at  peak  intensity  (a)  lO'^  W/cnr  and  (b)  5  x 
10^  W/cm^.  A  numerical  fit  of  reflectivity 
evolution  for  W  isdifficult  because  the  Drude 
model  may  not  be  easily  applied  with  com¬ 
plicated  initial  transition  metal  features. 
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QFB4  Fig.  3.  Theoretical  curves  by  extrap¬ 
olating  the  parameters  u(f),  Wpff),  L(!)  used  to 
fit  the  data  as  in  Fig,  1(b)  for  Al,  indicating 
significantly  different  temporal  response  of 
reflectivity  for  probes  of  620  nm  and  310  nm 
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QFB5  Generation  of 

high-current-density, 
subpicosecond  electron 
bunches  from  Ag  film  and 
Cu  mirror  using  amplified 
CPM  laser  pulse 

Trieeni  Srinivasan-Rjo,  T,  Txii.g,  J.  Fischer 
ii'/ii'iitiitioit  Division.  Hr.,.  -  •  .r.i'ii 

',  K.il  5x~}-B.  1.’: '  .Vc;:'  I'l'ie 
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Electrons  generated  by  ultrashort  laser  pulses 
have  pro\  en  to  be  excellent  sources  for  high- 
hrightness  electron  guns,'  much  sought  after 
tor  short-wavelength  FEL,  and  high-energy 
.icceleraiors.  The  brightness  or  the  e-beam  can 
he  increased  by  tnereasing  the  current  and 
-teCrea.sing  the  emittance.  We  ecplore  the  lea- 
-.'rii'.tv  ot  increasing  the  current  and  the  cur- 
ric:  i.ien';t\'.  using  ph-.'tueniissioii  with 
-ubpicosecond  laser  pulses  on  metal  cath- 

The  laser  system  used  in  these  studies  is  a 
dividing  pulse  mode-locked  (CI’M)  dye  laser 
loilowed  by  a  nitrogen-lascr-pumped  dye 
.'Miplifier  generating  up  to  3  gj  energy  with  a 
pulse  duration  of  280  fs  at  2-e\'  photon  en¬ 
ergy.  This  beam  is  focused  onw  the  cathode 
to  a  spot  sue  of  60  x  90  pm"  FWHM,  The 
a;  node  is  either  a  copper  mirror  or  a  thin  Ag 
:,iir,  held  parallel  to  the  anode,  at  a  pressure 
'll)  ’  Torr.  The  electrons  leasing  the  cath- 
.-de  are  measured  using  a  ch.arge-sensitive 
preamplifier  and  a  shaping  amplifier,  and  are 
displayed  on  an  oscilloscope. 

The  intensity  dependence  of  the  current 
densiw  from  copper  upon  irradiation  by  the 
amphtied  CPM  is  shown  in  Fig.  1  (-r).  Since 
the  work  function  of  copper  ts  -4.3  eV,  multi- 
photon  process  involving  at  least  two  pho¬ 
tons  ts  expected.  However,  contrary  to  this 
vvpectation,  the  slope  of  the  line  through  the 
i.it.i  is  1,  implying  a  singlts-photon  process. 
Till,-  maximum  current  densuy  delivered  by 
u.e  caiitode exceeds  lUk A  /  cm",  with  a  i.;ua il¬ 
ium  citici'.ncy  of  2  x  10"  .  No  surface  damage 
lias  been  observed  on  the  sample  up  to  laser 
intensities  of  2  x  lO"  VV/cm". 

Intensilv  dependence  of  photoemission 
from  ,Ag  film  for  both  the  unamplified  and  the 
amplified  CPM  are  also  sho'en  in  Fig.  1.  In 
keeping  with  the  energv  required  to  over¬ 
come  the  work  function  of  silver,  two- photon 
photoemission  is  observed  in  both  the  inten- 
=11  ■'  regimes,  and  the  consistency  between  the 
-.i.ita  in  the  two  regimes  .s  .viy  good.  The 
"riximum  current dcn.sUv observed  with  this 
-  pie  1-  .3  kA./cm".  No  -uriace  damage  to 
::.f  ,-\g  tii.-n  was  observed  .mer  irradiating 
'-Mth  laser  intensities  oi  -!o''  W/ern".  The 
a.jtocorreiation  signal  of  the  amplified  laser 
beam  and  the  e-beam  are  sh.  ■  ■■.  n  in  Fig.  2  The 


eUctton  pulse  appears  to  be  25‘T.  longer  than 
the  laser  pulse  This  dtscrepanev  could  be 
attributed  cither  to  the  nonumform  heating  ot 
the  electrons  or  the  low  signal-to-noise  rjiw> 
jii.i  tiuctuation  in  the  electron  signal,  ai.d  > 

,  jrrentiv  under  investigation  In  Fig  3.  li  e 
cr.arge.  .Q  is  plotted  against  the  extiacuon 
Held.  to  illustrate  the  magnitude  oi  the 
Schottky  effect  on  photoemission  itom  a 
smooth  film  for  various  input  energies  The- 
oretwallv,  for  a  txvo-photon  prcicess,  the  slope 
iii  me  line  should  be  proportional  to  .4  - 
vx  here  A  contains  material  constants  and  I  is 
the  intensity  of  the  laser.JThe  data  indicate 
that  the  slope  changes  as  xf.  This  discrepancy 
could  be  due  to  the  dependence  of  the  mate¬ 
rial  constants  on  the  electron  and  lattice  tem¬ 
perature  and  hence  the  laser  intensity 

In  conclusion,  subpicosecond  electriin 
hunches  of  current  densities  exceeding  10 
kA  /cin*'  has  been  generated  Since  no  uptn.ii 
damage  was  observed  so  far,  extraeiing 
higher  current  deirsities  are  feaeible,  if  huger 
extraction  fields  are  available  Sealing  I., 
larger  eurre’nts  is  also  possible  by  using  nieire 
energetic  laser  beams  of  comparable  intensi¬ 
ties 

1  H.  G.  Kirk  et  a/.,  "Experimental  results 
from  the  BN'L  ATF  photocathode  gun, 
to  be  published  in  Proc  Tliirhvnih  luu  i 
li.ltl.iililt  Fw-FJcctroil  falser  Ce'ii/i  icn.i  , 
S.uita  Fe,  Ncie  .Mexico,  Aug  23-,10. 
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QFB6  Electronic  disordering 

during  femtosecond  laser 
melting  of  GaAs 

P  N.  Saeta,  J.-K.  Wang,  Y.  Sicgal.  N 
Bloembergen,  E.  Mazur 
Conloii  McKay  Laboratory,  Ffarvartl 
Lliiitvrsity,  Cambridge.  MaSMihun-tts  iCl  5a 

,\n  intense  femtosecond  pulse  of  visible  light 
incident  on  the  surface  of  a  crystalline  semi¬ 
conductor  can  depiosil  energy  in  the  elec¬ 
tronic  system  in  a  time  that  is  short  compared 
to  the  electron-phonon  coupling  lime.  II  the 
density  of  deposited  energy  is  high  enough, 
a  layer  of  material  at  the  surface  may  be  able 
to  disorder  before  appreciable  energy  is 
transferred  from  the  electrons  to  the  ions  of 
the  lattice.’  Pumpt-probe  experiments  on  sil¬ 
icon  using  90-fs  pulses  showed  a  rise  in  retlex- 
tivity  and  a  decay  in  second-h.iriiHiiiK 
generation  (SHC)  consistent  with  the  iiielting 
of  the  surface  layer  after  the  excitation 
pulse.*'"'’  Because  the  intensity  of  the  reflected 
second-harmonic  signal  depends  on  crystal¬ 
lographic  orientation,  its  decay  serve's  as  a 
sensitive  probe  of  structural  changes.  GaAs 
offers  the  advantage  offers  SI  that  SHG  is 
d  ipole-allowed  in  the  bulk,  so  that  the  su  rface 
contribution  is  negligible  compared  to  the 
strong  bulk  signal.’ 

Pump-probe  experiments  were  per¬ 
formed  with  160-fs  pulses  (FWH.M)  on  a 
xenii-insulating  GaAs(llO)  wafer  in  air.  The 
probe  was  .s-polarized  and  incident  at  4- 
loim  the  normally  incident  pump  beam  i  K  ' 
origin  of  time  was  determined  from  the  --uin- 
trequency  signal  betw'een  the  pump  aiul 


prvbe  beams  fru-  sample  was  iransiatcd  in 
plane  beiw  c-i-n  iasvr  shots,  since  thcexciution 
fiuer.ce  exsc-x-js  the  damage  threshold 

Figure  ■.  show  s  the  fluence  dependence  of 

.riiici'.;  retiv' tivil V  and  second-hat- 
ni.ir,;,  sigii.i.-.  a;  .i  piobs- -delay  ul  i20ts,  nur- 
iv.ai.red  to  tiU  ,r  .uues  willvuut  excitation  At 
,1  tlifc’slHild  111  L  ;  '  ,  cni‘  the  second-harmonic 
;n’,c;.Mt\  b<--girw  '.v:-  drop,  by  0  2  j.-'em"  it  van- 
isl-.es  eieii  at  thus  short  delay.  Above  the 
threshold  ol  01  ).  cm'  the  rellectivity  rises, 
'...nuing  low  ard  a  saturated  value  44 '4  above 
the  unexcited  crvsialUne  caiue  in  Fig  2.  the 
same  signals  are  shciw  n  as  a  l  unction  ol  prohjt 
delay  lor  an  incident  lluence  of  0.33  l/cm* 
The  curves  through  the  data  points  are  expo¬ 
nential  fits  giving  response  times  of  90  and 
170  fs  lor  the  SI  IG  decay  and  rc'flectivity  rise 
recjvctic  eK 

Ihe  high  tetlvciivily  tliat  obtains  5*.?0  Is 
•ilier  exoi.umn  was  compared  to  a  Drude 
model  oj  moiten  Ga.As,  The  observed  44' , 
rise  i>  coiwisioiit  with  ,5C''1  loni/alion  of  la- 
ivnce  ir,'  in  the  excited  phase  Thw 

more  than  .m  laJerc'l  magnitude  greater  ilia: 
the  density  ot  photoexeited  electrons,  8  x  iO" 
cm  ,  based  on  linear  absorplion.  This 
strongly  suggest  an  elt'Ctronic  phase  Iransfor- 
malum.  1  he  sharp  decay  of  SHG  with  a  90-ts 
time  constant  indicates  tK,ii  the  excited  elec¬ 
tronic  stale  Is  a  ceiilros)  iiimetric  one,  in 
whieli  SHC  ;>  dipole  forbidden  Since  the 
time  scale  for  energy  relaxation  from  elec¬ 
trons  to  phonons  in  the  crystal  is  "=  2  ps,”  the 
etectronic  si  stem— and  likely  the  atomic  lat¬ 
tice — disi'iders  beiore  appreciable  energy  is 
tianslorrsxi  to  lattice  \  ibralions. 

Current  .i.idu’s-  ATcrT  Bell  Uihoralones,  oiK) 
Mouiilani  .Avenue.  Murray  Hill,  New  leney 
II797-} 
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Effect  of  barrier  thickness  asymmetries  on  the  eiectricai  characteristics 
of  AlAs/GaAs  double  barrier  resonant  tunneling  diodes 
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We  report  the  impact  of  small  barrier  thickness  asymmetry  on  the  dc  l-V  characteristics  of 
AlAs/GaAs  double  barrier  resonant  tunneling  diodes.  With  a  bottom  AlAs  barrier  6  ML  thick 
and  GaAs  well  18  ML  thick,  the  effects  of  varying  the  top  AlAs  barrier  thickness  from  5  to  8 
ML  produced  significant  changes  in  peak  current  density,  peak  voltage,  and  peak-to-valley 
current  ratio  (PVCR).  PVCRs  of  5.6  were  obtained  on  a  7/18/6  monolayer  structure,  the 
highest  reported  to  date  for  an  AlAs/GaAs  DBRTD  structure. 


I.  INTRODUCTION 

For  high  performance  double  barrier  resonant  tunneling 
diodes  (DBRTDs),  precise  control  of  the  quantum  well 
thickness  and  the  barrier  and  spacer  layer  thicknesses  j 
critical.  Minor  variations  in  any  of  these  layer  thicknesses 
can  have  tremendous  impact  on  the  resulting  dc  !-V  pa¬ 
rameters.  such  as  the  peak-to-valley  current  ratio 
(PVCR).  the  peak  current  density  {Jfl,  the  difference  be¬ 
tween  the  peak  voltage  and  the  valley  voltage  (AF),  and 
the  difference  between  the  peak  current  density  and  the 
\  alley  current  density  (\J). 

Previous  studies  of  barrier  thickness  asymmetries  in 
DBRTDs  have  dealt  with  thick  AIGaAs  barriers  which 
varied  by  many  monolayers.'  '  Asymmetries  in  dc 
current-voltage  characteristics  have  also  been  observed  in 
non.inally  symmetric  structures,  where  the  cause  has  been 
attributed  to  interface  roughness  from  the  inverted  inter¬ 
face,  GaAs  on  AIGaAs  or  GaAs  on  AlAs.'*’  Here  we 
present  the  effects  of  intentionally  grown  barrier  thickness 
asymmetries  on  thin  barrier,  high  current  density  AlAs/ 
GaAs  DBRTDs.  Experimental  measurements  are  also 
compared  to  simulation  results  using  a  self-consistent  cal¬ 
culation  that  couples  the  Schrodinger  equation  in  the  quan¬ 
tum  well  region  to  the  drift-diffusion  equation  in  the  rest  of 
the  device.** 

II.  EXPERIMENTAL  PROCEDURE 

The  DBRTDs  were  grown  in  a  Varian  Gen  II  molecular 
beam  epitaxy  (MBE)  system  using  n’  (2x10'**  cm  ’) 
silicon-doped  (100)  liquid  encapsulated  Czochraiski 
(LEG)  GaAs  substrates.  On  all  structures,  the  substrate 
temperature  was  kept  at  600 °C  with  a  0.5-1.0  ^m  n" 
(4  x  lO''  cm  ')  GaAs  buffer  layer  grown  at  a  rate  of  I 
ML/s.  Three  spacer  layers  surround  the  quantum  well, 
consisting  of  50  A  of  nominally  undoped  GaAs,  100  A  of 
5  lO'**  cm  '  /T-type  GaAs,  and  100  A  of  6x  lO'^  cm  ’ 

n-type  GaAs.  An  (4  X  10"*  cm  ')  GaAs  cap  layer  was 
grown  on  all  samples  to  facilitate  ohmic  contact  formation. 
All  growth  rates  were  calibrated  using  reflection  high-en¬ 
ergy  electron  diffraction  (RHEED)  intensity  oscillations, 


immediately  before  and  after  actual  growth  of  the  DBRTD 
devices.  The  AlAs  growth  rates  used  were  0.25  or  0.30 
ML/s,  while  the  growth  rate  for  the  GaAs  spacer  layers 
and  GaAs  well  was  0.4  ML/s.  Growth  was  interrupted  for 
4  s  at  each  heterointerface.  For  all  devices,  the  bottom 
AlAs  barrier  and  GaAs  well  growth  times  were  set  to  pro¬ 
duce  nominal  layer  thicknesses  of  6  and  18  ML.  respec¬ 
tively.  The  AlAs  top  barrier  growth  time  was  altered  for 
the  different  DBRTDs  to  produce  nominal  layer  thick¬ 
nesses  of  5.  5.5.  6,  6.5.  7,  and  8  ML. 

The  device  structures  were  mesa  isolated  using  an  8;l;l 
H;S04;H20cH^0  etch.  The  top  Ni/AuGe/Ni  contact 
metallization  was  annealed  at  450  'C  for  .30  s.  The  backside 
ohmic  contacts  were  formed  by  the  indium  metallization 
used  to  mount  the  GaAs  substrates  to  the  molybdenum 
bltKks  during  MBE  growth. 

The  devices  were  tested  using  a  Keithley  2.30  program¬ 
mable  voltage  source  and  a  Keithley  195A  digital  multi¬ 
meter  controlled  by  an  IBM  PC- AT.  The  devices  ranged 
from  3  to  7  /sm  in  diameter. 

III.  RESULTS  AND  DISCUSSION 

Experimental  and  simulation  results  are  summarized  in 
Fig.s.  I  and  2.  In  our  simulalion.s,  an  AlAs  effective  mass  of 
0. 1  Im,.  provided  the  best  fit  to  the  experiments.  The  change 
from  the  band  edge  value  of  0.15m,.  can  be  attributed  to 
strong  renormalization  of  the  T  electron  effective  mass  in 
the  AlAs.’  Figure  1  shows  that  for  bia.ses  ("reverse"  bias) 
such  that  electrons  are  incident  on  the  top  AlAs  barrier 
first,  the  peak  current  density  drops  dramatically  as  this 
width  is  varied  from  5  to  8  ML.  For  electron  injection 
through  the  fixed-thickness  bottom  barrier  first  (“for¬ 
ward”  bias),  the  variation  is  significantly  less.  Figure  2 
shows  similar  behavior  for  the  peak  voltage  in  forward  and 
reverse  bias.  Both  our  simulation  and  experimental  results 
show  that  thickness  variations  in  the  entry  barrier  have  a 
much  larger  impact  on  dc  /-F  characteristics  than  thick¬ 
ness  variations  in  the  exit  barrier. 

Figure  3  shows  that  structural  asymmetries  also  have  a 
significant  impact  on  peak-to-valley  current  ratio.  Signifi- 
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Fui  1.  Peak  current  density  vs  top  AlAs  barrier  thickness  for  both  sim¬ 
ulation  and  experimental  results  Electron  injection  through 

variable  thickness  top  AlAs  barrier  first;  (  -  -  O-  - ).  Simulation  results  for 
electron  injection  through  variable  thickness  top  AlAs  barrier  first; 
( — ■ — );  Electron  injection  through  6  ML  bottom  AlAs  barrier  first; 
( — u — ):  Simulation  results  for  electron  injection  through  6  MI,  bottom 
AlAs  barrier  first.  One-sigma  error  bars  are  included  for  experimental 
data 


Fig.  3.  Peak  to  valley  current  raiio  vs  lop  AlAs  barrier  thickness  for 
experimental  results.  ( —  O  — )  .  Electron  injection  through  variable  thick¬ 
ness  lop  AlAs  barrier  first,  i  -  Electron  injection  through  b  ML 

boliom  AlAs  barrier  first  One-sigma  error  bars  are  included  for  this 
experimental  data 


cantly  higher  PVCRs  were  observed  when  the  electrons 
passed  through  the  thicker  barrier  first,  regardless  of 
whether  the  thicker  barrier  is  on  the  top  or  bottom  of  the 
GaAs  well.  As  the  top  AlAs  barrier  thickness  was  varied 
from  5  to  7  ML,  an  increase  in  the  PVCR  in  the  reverse 
bias  case  (electron  injection  through  the  top  AlAs  first) 
was  seen.  This  trend  did  not  continue,  however,  as  the  top 
Alas  barrier  thickness  was  increased  to  8  ML.  It  is  believed 
that  as  the  thickness  increased  from  7  to  8  ML 
r(!aAs--^AiAs  tunncling  effects  may  become  significant, 
which  tends  to  degrade  the  PVCR.  Other  efforts  to  im¬ 
prove  PVCR  for  AlGaAs/AlAs/GaAs  DBRTDs  have  also 


used  asymmetric  barriers,  producing  PVCRs  of  6.3."*  Our 
7/18/6  monolayer  DBRTD  demonstrated  a  PVCR  of  5.6 
in  reverse  bias,  which  is  the  highest  reported  to  date  for  a 
simple  AiAs/GaAs  DBRTD. 

The  amount  of  dc  /-V  asymmetry  in  each  of  the  devices 
is  summarized  by  taking  the  ratios  of  the  forward  and 
reverse  bias  data,  as  shown  in  Fig. 4.  Interestingly,  the  de¬ 
vice  that  exhibited  the  most  symmetric  characteristics  was 
not  the  nominally  6/18/6  monolayer  DBRTD,  but  rather 
the  5.5/18/6  monolayer  DBRTD.  This  indicates  that  there 
may  be  a  MBE  growth  asymmetry  of  unknown  origin 
which  causes  the  AlAs  barrier  grown  after  the  GaAs  quan¬ 
tum  well  to  be  approximately  half  a  monolayer  thicker 
than  the  barrier  grown  before  the  well. 


4.3  5.3  6.3  7.5  8.3 


Top  AlAo  Barrior  Thicknass  (ML) 

FIG  2.  Peak  voliage  vs  top  AlAs  barrier  thickness  for  both  simulation 
and  experimental  results  Electron  injection  through  variable 

thickness  top  AlAs  barrier  first:  (-  -O-  Simulation  results  for  electron 
injection  through  variable  thickness  top  AlAs  barrier  first;  ( — ■ — );  Elec¬ 
tron  injection  through  6  ML  bottom  AlAs  barrier  first;  ( — G — );  Simu¬ 
lation  results  for  electron  injection  through  6  .ML  bottom  AlAs  barrier 
first.  One-sigma  error  bars  are  included  for  experimental  data 


Top  AlAt  Barrier  Thieknaaa  (ML) 


Fig.  4.  Ratio  of  the  forward  and  reverse  bias  dc  /-  f  parameters  vs  the  top 
AlAs  barrier  thickness.  This  graph  displays  the  asymmetry  in  dc  /-F 
parameters  as  top  AlAs  barrier  thickness  is  varied.  The  device  with  a  5.5 
ML  top  AlAs  barrier  is  shown  to  be  most  symmetric.  ( — D— );  Ratio  of 
the  forward  and  reverse  bias  peak  current  densilies.  ( — ■ — ):  Ratio  of  the 
forward  and  reverse  bias  valley  current  densities  ( — O  — ):  Ratio  of  the 
forward  and  reverse  peak-to-v  alley  ratios  ( — A — ):  Ratio  of  the  forward 
and  reverse  bias  peak  voltages 
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IV.  CONCLUSIONS 

We  have  reported  the  impact  of  small  barrier  thickness 
asymmetries  on  the  dc  I-V  characteristics  of  AlAs/GaAs 
DBRTDs.  By  carefully  monitoring  growth  rates  before  and 
after  each  device  structure  growth  run,  DBRTD  structures 
were  grown  with  nominal  top  AlAs  barner  thicknesses  of 
5,  5.5,  6,  6.5,  7,  and  8  ML.  It  was  seen  that  the  PVCR  is 
significantly  affected  by  the  amount  of  asymmetry  in  bar¬ 
rier  thickness,  with  higher  PVCRs  occurring  when  the 
electron  moves  through  the  thicker  barrier  first,  regardless 
of  whether  the  thicker  barrier  was  on  the  top  or  the  bottom 
of  the  structure.  For  the  7/18/6  DBRTD,  a  PVCR  of  5.6 
was  measured,  which  is  the  highest  reported  to  the  best  of 
our  knowledge.  For  peak  current  densities  and  voltages, 
simulation  results  are  in  qualitative  agreement  with  our 
experimental  results.  Simulation  and  experimental  results 
show  that  thickness  variations  in  the  entry  barrier  have  a 
much  larger  impact  on  dc  /-  V  characteristics  than  thick¬ 
ness  variations  in  the  exit  barrier. 
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We  have  examined  the  influence  of  growth  interruption  on  the  electrical  characteristics  of  thin 
barrier,  high-current  density  AlAs/GaAs  resonant  tunneling  diodes  (RTDs)  grown  by 
molecular-beam  epitaxy.  Interrupt  schedules  were  determined  by  independent  reflection  high- 
energy  electron  diffraction  measurements  of  specular  spot  intensity  oscillations  during 
prototypical  device  growth  sequences.  Our  data  suggests  that  interface  roughness  at  inverted  and 
normal  interfaces  does  not  play  a  significant  role  in  determining  the  perpendicular  transport 
characteristics  of  high  current  density  AlAs/GaAs  RTDs. 


I.  INTRODUCTION 

The  double  barrier  resonant  tunneling  diode  (DBRTD) 
has  been  investigated  extensively  for  device  applications 
such  as  microwave  oscillators.  ‘  ‘  To  improve  the  perfor¬ 
mance  of  these  molecular-beam  epitaxy  (MBE)  grown  de¬ 
vices  both  structural  and  growth  parameters  need  to  be 
optimized.  One  such  growth  parameter  is  the  interrupt 
time  used  at  each  heterointerface  in  the  device. 

investigations  of  the  effect  of  growth  interruption  dur¬ 
ing  MBE  have  focused  mainly  on  the  correlation  between 
interrupt  time  and  quantum  well  photoluminescence 
linewidths'  and  reflection  high-energy  electron  diffraction 
(RHEED)  specular  spot  intensity  oscillations.'*  Interrupts 
of  an  appropriate  duration  during  quantum  well  growth 
have  been  shown  to  result  in  luminescence  spectra  indica¬ 
tive  of  atomically  smooth  and  abrupt  interfaces.  The  effect 
of  growth  interruptions  on  the  low  temperature  electrical 
characteristics  of  thick  barrier,  very  low  current  density 
AlGaAs/GaAs  DBRTDs  has  been  previously  investigated 
by  Gueret  ei  al.'  The  optimization  of  DBRTDs  for  oscil¬ 
lator  applications,  however,  requires  the  study  of  high  cur¬ 
rent  density  devices.  Here,  we  examine  the  influence  of 
growth  interruption  on  the  current-voltage  {I-V)  charac¬ 
teristics  of  thin  barrier,  high  current  density  AlAs/GaAs 
DBRTDs. 

II.  EXPERIMENT 

The  interrupt  durations  used  for  the  DBRTDs  were 
based  on  the  results  of  independent  RHEED  experiments. 
These  consisted  of  measurements  of  the  specular  spot  in¬ 
tensity  oscillations  during  prototypical  device  growth  se¬ 
quences  consisting  of  6  monolayers  (ML)  AlAs,  interrupt, 
18  ML  GaAs,  interrupt,  and  finally  6  ML  of  AlAs.  The 
interrupt  time  was  varied  from  0  to  60  s.  The  GaAs  and 
AlAs  growth  rates  were  0.4  and  0.3  ML/s,  respectively. 
The  As/Ga  beam  equivalent  pressure  ratio  was  30  and 
As/Ga  incorporation  ratio  was  3.1  at  a  substrate  temper¬ 
ature  of  600  "C. 

For  electrical  tests,  four  structurally  symmetric 
DBRTDs  were  grown  at  600  "C  with  different  growth  in¬ 
terruption  schedules.  The  growth  conditions  were  nomi¬ 


nally  the  same  as  those  employed  in  the  RHEED  measure¬ 
ments.  The  layer  schematic  of  the  structure  along  with  the 
interrupt  schedule  are  given  in  Fig  1  Sample  A  is  grown 
without  any  interruption.  For  samples  B  and  C,  growth 
interruptions  of  20  and  60  s.  respectively,  were  employed 
only  after  the  GaAs  layers,  while  no  interruptions  followed 
the  growth  of  the  AlAs  layers.  For  sample  D.  growth  in¬ 
terruptions  of  60  s  only  on  the  AlAs  layers  were  used,  with 
no  interruptions  on  the  GaAs  layers  All  samples,  grown 
on  ( 100)  n  ■  GaAs  substrates,  consisted  of  a  nominally 
undoped  (n  type,  1  X  lO'^  cm  ')  18  ML  GaAs  quantum 
well  sandwiched  between  6  ML  AlAs  barners.  followed  by 
a  three-step  dopant  transition  region  consisting  of  50  A  of 
nominally  undoped  GaAs  adjacent  to  the  AlAs  barriers. 
100  A  of  n  type,  6X  10'“  cm  '  GaAs,  and  finally  100  A  of 
n  type,  4.3x  lO”  cm  ’  GaAs.  Mesa  isolated  diodes  were 
formed  with  Ni/AuGe/Ni/Au  and  backside  indium  ohmic 
contacts  that  were  annealed  at  450 ’C  for  30  s.  The  /-K 
characteristics  were  obtained  for  10-15  devices  per  sample 
with  a  computerized  data  acquisition  system. 

III.  RESULTS  AND  DISCUSSION 

'  he  results  of  the  independent  RHEED  measurements 
are  shown  in  Fig.  2.  This  figure  illustrates  the  evolution  of 
intensity  oscillations  as  the  interrupt  time  between  layers 
was  varied  from  0  to  60  s.  In  this  figure,  the  interrupt  time 
scales  have  been  compressed  to  allow  easy  visual  compar¬ 
ison  of  the  relative  oscillation  amplitudes  of  the  three  lay¬ 
ers.  In  all  cases,  RHEED  oscillations  were  maintained 
throughout  the  entire  growth  sequence.  Also,  very  strong 
oscillations  were  observed  during  the  growth  of  the  first  6 
ML  AlAs  layer.  However,  interruptions  on  AlAs  did  not 
result  in  specular  spot  inten.sity  recovery  with  the  intensity 
remaining  essentially  constant  until  the  GaAs  quantum 
well  growth  was  initiated.  Furthermore,  as  the  growth  in¬ 
terruption  was  increased  on  AlAs.  the  GaAs  grown  on  top 
of  it  became  rougher  as  evidenced  by  a  reduced  oscillation 
amplitude  during  the  GaAs  quantum  well  growth.  This 
may  indicate  that  the  very  thin  6  ML  AlAs  layer  was  fairly 
smooth,  and  so  growth  interruption  was  not  beneficial. 

In  contrast  to  the  behavior  of  the  AlAs  layer,  specular 
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Fif  i  1  Lavtr  schcmjMc  of  the  DBRTD  >truciurc  along  with  inter^^upt 
>chcdulc  for  the  foiir  DBRTDs  studitfd 

sptn  intensity  recovery  was  observed  for  interrupt'  on  the 
18  ML  thick  GaAs  layer  If  no  interruption  was  used  after 
the  GaAs  growth,  the  RHEED  oscillations  seen  during  the 
growth  of  the  final  AlAs  layer  were  much  worse  than  those 
seen  during  the  grow  th  of  the  first  AlAs  layer  However,  a 
60  s  interruption  on  the  GaAs  layer  was  sufiicient  to  pro¬ 
duce  oscillations  from  the  final  AlAs  layer  with  amplitude 
comparable  to  that  of  the  first  AlAs  layer.  Since  the  inter¬ 
rupt  schedule  affects  the  interface  quality,  it  is  possible  it 
may  affect  the  device  /-L  characteristics.  From  the  data 
shown  in  Fig  2.  u  might  be  expected  that  a  DBRTD  de¬ 
vice  grown  using  interruptions  only  after  the  growth  of 


FKi  2  E'oluiion  of  RHEED  specular  spot  inlen.sily  oscillations  during 
prnloispical  growth  sequences  consisting  of  h  ML  AlAs/inlerrupt/lH  ML 
GaAs.  mterrupt/6  ML  AlAs  The  interrupt  time  was  varied  from  0  to  60 
s  1  he  interrupt  time  scjIcn  hjsc  been  compressed  to  allow  easy  compar¬ 
ison  of  the  relative  oscillation  amplitudes  of  the  three  layers. 


Eu,  V  Summary  ot  measured  f-L  data  for  samples  A.  B.  L  .  and  D  The 
values  lor  7  .  pvr  R  f  ^  jd^i  tiomiah/ed  tir  their  respective  as 

erages  based  on  alt  samples  and  both  bias  direeliiuis  The  average  values 
lor  J„  PVC'R.  I  .  and  d  I  were  4  '  kA  em  ,  4  4.  0  g.l  V.  am)  I)  22  V. 
respeviivelv  the  standard  devcitiotis  i'ln  all  ijuanlilics  presented  were  less 
than  5G 

GaAs  layers,  and  not  after  AlAs  barrier  layers,  should 
improve  the  /-F  characteristics  over  a  device  grown  with 
no  interruptions.  Further,  a  device  grown  with  long  inter¬ 
ruptions  (60  s)  after  Al.As  layers,  hut  not  after  the  GaAs 
layers,  might  result  in  poorer  characteristics  compared  to 
one  grown  with  no  interruptions. 

Measured  /-F  data  for  samples  A,  B,  C.  and  D  are 
summarized  in  Fig.  .L  which  shows  normalized  values  for 
peak  current  density  (JJ.  pcak-to-valley  current  ratio 
(PVCR).  peak  voltage  '  i'^).  and  difference  between  peak 
and  valley  voltage  (liF)  for  b<ilh  electron  injection 
through  the  bottom  barrier  first  (forward  bias)  and  for 
injection  through  the  top  barrier  first  (reverse  bias)  The 
quantities  are  scaled  to  their  respective  averages  based  on 
all  samples  and  both  bias  directions.  The  average  values  for 
4.  PVCR.  F^  and  AF  are  43  kA/cm'.  4.4,  0.83  V,  and 
0.22  V.  resfiectively. 

For  both  bias  directions,  and  PVCR  are  independent 
of  both  interrupt  time  and  schedule.  The  F„and  A  F  appear 
to  be  .sample  dependent  in  that  a  higher  F,,  and  smaller  AF 
is  obtained  in  sample  D  (60  s  growth  interruption  on  AlAs 
only).  Although  this  may  be  due  to  the  interrupt  schedule, 
it  could  also  be  due  to  parasitic  series  resistance.  In  fact,  a 
similar  structure  grown  during  a  previous  system  cycle 
c.xhibited  characteristics  similar  to  samples  A.  B,  and  C. 
Surprisingly,  the  60  s  growth  interruption  on  GaAs  did  not 
improve  the  PV'CR  and  even  though  the  RHEED  mea¬ 
surements  indicated  that  tills  interrupt  did  cause  significant 
interface  smoothing.  Growth  interruption  on  the  AlAs 
seems  to  have  had  little  impact  on  /-F  characteristics. 
These  observations  indicate  that  any  interface  roughness 
present  in  our  thin  barrier,  high  current  density  AlAs/ 
GaAs  DBRTDs  does  not  play  a  significant  role  in  deter¬ 
mining  perpendicular  transport  characteristics. 

All  the  DBRTD  samples  studied  here,  regardless  of  in- 
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terrupt  schedule,  display  a  slight  asymmetry  in  both  PVCR 
and  Jp  between  forward  and  reverse  bias.  Electron  injection 
from  top  to  bottom  (i.e.,  reverse  bias)  results  in  higher 
PVCR  and  lower  J^,  whereas  for  electron  injection  from 
bottom  to  top  (forward  bias),  the  PVCR  is  lower  and  the 
Jp  is  higher.  If  the  PVCR  is  taken  to  be  a  measure  of  the 
interface  roughness^ '  then  our  data  suggests  that  the  role 
of  the  inverted  (GaAs  on  AlAs)  and  normal  (AlAs  on 
GaAs)  interfaces  in  our  context  is  not  very  significant.  In 
fact,  all  of  the  samples  studied  here  exhibit  a  higher  PVCR 
for  electron  transport  from  top  to  bottom,  where  electrc.is 
encounter  a  nominally  rougher  AlAs  interface  first,  rather 
than  bottom  to  top,  where  electrons  encounter  a  nominally 
smoother  AlAs  interface  first.  For  our  devices,  we  believe 
the  observed  asymmetry  is  due  to  a  growth-induced  thick¬ 
ness  asymmetry,  which  yields  a  top  AlAs  barrier  that  is 
slightly  thicker  than  the  bottom  AlAs  barrier.'' 

IV.  CONCLUSION 

We  have  examined  the  influence  of  growth  interruption 
on  the  electrical  characteristics  of  thin  barrier,  high-cur¬ 
rent  density  AlAs/GaAs  DBRTDs  grown  by  MBE.  Inter¬ 
rupt  schedules  were  determined  by  independent  RHEED 
measurements  of  the  intensity  oscillations  during  prototyp¬ 
ical  device  growth  sequences.  Four  nominally  symmetric 
DBRTDs  with  different  interrupt  sequences  were  fabri¬ 
cated  and  electrically  characterized.  Our  data  suggests  that 


interface  roughness  at  inverted  and  normal  interfaces  does 
not  play  a  significant  role  in  determining  the  perpendicular 
transport  characteristics  of  high  current  density  AlAs/ 
GaAs  DBRTDs. 
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A  set  of  quantum  welb  containing  wells  of  integer  n  and  integer  plus  half-monolayer  n  4  1/2 
thickness  were  grown.  The  photoluminescence  of  these  samples  contained  distinct  peaks  far  both 
the  n  and  «  4  1/2  wells.  TTie  structure  was  regrown  with  growth  interruption  between  layers. 

With  sufficiently  long  interrupt  times  (  ~60  s)  the  peaks  due  to  the  a  4  1/2  wells  disappeared 
leaving  only  peaks  from  wells  of  integer  monolayer  thickness.  This  result  can  be  interpreted  in 
terms  of  structural  changes  at  the  interfaces  such  as  island  coalescence.  The  intercity  of  the 
peaks  and  shifts  in  their  energy  also  provide  a  measure  of  contamination  and  microroughness. 


I.  INTRODUCTION 

Photoluminescence  (PL)  is  an  extremely  powerful  tool  for 
the  analysis  of  GaAs/AlGaAs  quantum  wells  (QWs)  Due 
to  the  confinement  of  carriers  the  PL  transition  energy  is 
shifted  to  higher  energies  as  the  well  thickness  is 
decreased.'  The  linewidth  of  the  PL  transition  from  the 
well  is  sensitive  to  the  structure  of  the  interfaces  between 
the  layers^"*  and  the  intensity  of  the  transition  is  related  to 
the  material  quality,'  *  This  sensitivity  to  layer  structure 
and  quality  can  be  used  to  study  the  growth  processes 
which  occur  dunng  molecular  beam  epitaxy  (MBE) 

A  common  growth  technique  is  to  intcTUpt  growth  be¬ 
tween  layers  during  the  growth  of  QWs.  Migration  of  the 
adatoms  on  the  surface  changes  the  size  and  oistribution  of 
islands  present  and  is  reflected  in  the  PL  transition  energy 
and  linewidth.  Often  the  growth  of  a  single  QW  with  in¬ 
terruption  will  give  rise  to  more  than  one  peak  due  to  the 
formation  of  wells  of  different  thickness  distributed  later¬ 
ally  across  the  sample.  These  multiple  peaks  can  be  inter¬ 
preted  m  terms  of  a  simple  model  in  which  the  exciton 
samples  the  potential  in  its  local  environment.*  ^  This  be¬ 
havior  has  been  related  to  the  recovery  of  the  reflection 
high-energy  electron  diffraction  (RHEED)  intensity  upon 
growth  interruption,**' 

II.  EXPERIMENT 

In  this  article  we  propose  an  additional  technique  in¬ 
volving  growth  interruption  and  PL  of  QWs.  Shown  in  Fig. 
1  is  the  PL  spectrum  of  a  set  of  GaAs  QWs  grown  with 
wells  of  integer  n  monolayer  thickness.  In  this  sample  wells 
of  4,  5.  6,  7.  8.  9,  10,  11,  13,  15.  19,  25.  and  35  ML  were 
grown  separated  by  300  A  AIo  ^GsotAs  barriers.  The  tran¬ 
sitions  from  this  sample  give  a  series  of  sharp  and  well 
separated  peaks,  one  for  each  well  thickness.  This  sug¬ 
gested  the  idea  of  growing  wells  of  nominal  thickness  equal 
to  n  +  i/2  ML.  A  sample  was  grown  in  which  the  shutters 
were  t.med  to  give  nominal  well  thicknesses  of  3,  3.5,  4, 
4.5,  5.  5.5,  6.  6.5,  7,  7.5,  8,  8.5.  9,  9.5,  10.  10.5,  and  1 1  ML 
separated  by  300  A  barriers.  The  PL  of  this  sample  is 
shown  in  Fig.  2  where  17  distinct  peaks  can  be  seen,  again 
one  for  each  of  the  well  thicknesses  grown.  This  same 


structure  was  also  grown  with  growth  interruptions  of  10, 
30,  60,  and  1 20  s  between  all  layers.  The  PL  of  the  samples 
grown  with  10  and  30  s  interrupts  arc  shown  in  Fig  3  In 
this  figure  ihe  peaks  arc  beginning  to  blur  together  such 
that  the  number  of  distinct  peaks  is  decreased  This  trend 
is  continued  for  the  case  of  60  s  interrupts  shown  as  the 
upper  spectrum  of  Fig.  4  In  this  spectrum  the  half-mono¬ 
layer  transitions  have  disappeared  leaving  nine  relatively 
distinct  transition  which  corresponds  to  the  number  of  in¬ 
teger  monolayer  wells  grown.  The  spectrum  for  interrupts 
of  120  s  is  shown  as  the  bottom  spectrum  in  Fig  4  where 
again  di.siinct  transitions  are  seen,  but  surprisingly  there 
arc  even  fewer  peaks  present, 

All  samples  were  grown  in  a  Varian  Gen  if  at  620  *C 
with  an  As  cracking  source.'®  The  Ga  growth  rate  was  set 
to  0.7  ML/s  and  the  Ai  growth  rate  was  set  to  0  3  ML/s 
The  A.s/Ga  incorporation  ratio  was  kept  ai  approximately 
14  All  PL  spectra  were  taken  at  4  K  with  the  samples 
immersed  in  liquid  He.  The  samples  were  excited  with  the 

514.5  nm  line  of  an  Ar  ion  la.scr  at  a  power  density  of  100 
mW/cm^  The  luminescence  was  passed  through  a  SPEX  1 
m,  single-grating  spectrometer  and  detected  with  an  RCA 
C31034  GaAs  photocathode  detector. 

III.  DISCUSSION 

The  most  prominent  trend  to  be  seen  in  Figs.  2-4  is  the 
decrease  in  the  number  of  well  transitions  as  Ihe  interrupt 
time  increases  Without  interruption  17  peaks  are  distin¬ 
guishable,  and  this  reduces  to  nine  peaks  after  60  s  inter¬ 
rupts.  This  is  apparently  due  to  a  coalescence  of  the  nom¬ 
inal  half-monolayer  wells  into  wells  of  monolayer 
thickness.  Thus,  for  example,  one  can  1  magi  4  the  nominal 

3.5  ML  vyell  a$  possessing  closely  spaced  regions  of  3  and 
4  ML  thicknesses.  This  results  in  a  transition  energy  inter¬ 
mediate  between  that  of  a  3  and  a  4  ML  well,  which  dis¬ 
appears  as  the  regions  of  3  and  4  ML  become  larger  than 
the  exciton.  A  potential  problem  for  this  explanation  is  the 
apparent  absence  of  some  peaks  from  the  sample  grown 
with  120  s  interrupts.  However,  we  believe  this  is  due  to 
the  degradation  of  the  .sample  from  the  long  interrupt  time 

Degradation  with  interrupt  time  is  indicated  by  the  in- 
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Wavelength  (nnti) 


Fig.  1.  PL  spectrum  of  QW  sample  contamini  U  wells.  4.  5,  6,  7.  8.  9.  10. 
1 1,  13,  15.  19.  23.  and  33  .ML  thick,  separated  by  300  A  Al„  ,Ga,,As 
barriers. 


tegrated  intensities  from  this  scries  of  samples  which  was 
found  to  monotonicaily  decrease  with  increasing  interrupt 
time  (the  integrated  intensity  was  down  a  factor  of  2  for 
the  120  s  interrupt  sample).  We  have  previously  found 
degradation  in  the  PL  intensity  and  linewidth  of  QW  sam¬ 
ples  grown  at  diflferent  As  cracking  temperatures,  which 
we  attributed  to  the  generation  of  small  amounts  of  AsO.* 
Even  when  the  AsO  was  reduced  below  the  detection  limits 
of  the  residual  gas  analyzer  after  an  extensive  bake,  we  still 
found  some  degradation  in  the  PL  indicating  the  contam¬ 
inant  had  not  been  completely  removed.  Since  contami¬ 
nants  may  be  expected  to  build  up  on  the  surface  during 
growth  interruption,  it  is  not  surprising  that  PL  degrada¬ 
tion  is  seen  in  the  present  work.  Our  earlier  work  also 
indicated  that  the  larger  wells  grown  near  the  substrate 
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Fig.  2  PL  spectrum  of  QW  sample  containing  integer  and  half-mono¬ 
layer  wells,  of  nominal  Ihicknesses  3,  3.5,  4,  4.5.  5.  5,5.  6,  6.3.  7,  7.5,  8, 
8.5,  9, 9.5, 10,  10,5,  and  1 1  ML.  separated  by  300  A  AI„  ,Gao-A6  harriers. 
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Fig  3  PL  spectrum  of  QW'  sample  coniatntng  int^r  and  haif-mono- 
layer  wells  grown  with  mierrupi  times  of  (a)  10  s  and  lb)  30  % 

seemed  to  suffer  a  proportionally  higher  loss  in  peak  inten¬ 
sity  than  the  thinner  wells  near  the  surface,  similar  to  the 
effect  seen  in  Fig.  4. 

There  is  clearly  not  a  distinct  change  of  half-tmmolayer 
wells  into  regions  of  atomically  flat,  int^er-monolaycr 
wells.  Instead,  we  observe  a  shift  in  the  transition  energies 
of  the  nominally  integer-monolayer  wells.  This  is  shown  in 
Fig.  5  where  the  transition  energy  for  the  3  and  4  ML  wells 
are  plotted  as  a  function  of  interrupt  time.  In  Fig.  5  one 
can  see  a  relatively  large  shift  in  the  transition  wavelengths 
to  longer  wavelengths  as  the  growth  interruption  time  in¬ 
creases,  although  it  does  seem  to  approach  a  constant  value 
for  longer  interrupt  times.  Shown  for  comparison  in  Fig.  5 
is  the  calculated  transition  wavelengths  for  well  thicknesses 


Flo.  4  PL  spectrum  of  QW  Mm|4e  contnning  tntc|^  Mid  haif-maiKv 
layer  wells  grown  with  mtemipi  times  of  (s)  60  s  snd  fb)  120  s. 
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GtowUi  Intemipiion  Time  (seci 


Fig.  3.  Measured  PL  transition  wavelengths  for  wells  of  nominal  }  and  4 
ML  thiclcness  with  increasing  interrupt  time  Also  shown  for  companson 
are  the  calculated  transition  wavelengths  (dashed  lines) 


of  2,  3,  and  4  ML  using  an  effective-mass  model  "  These 
results  can  be  interpreted  in  terms  of  the  "microrough- 
ness”  model.'*  In  this  model  the  interface  between  the  lay¬ 
ers  is  nearly  abrupt  on  an  atomic  scale  with  small  regions 
(much  less  than  an  exciton  diameter  in  size)  deviating  in 
thickness  by  one  or  more  monolayers.  The  exciton  sees  an 
average  potential  and  gives  rise  to  a  sharp  transition  but 
shifted  in  energy  from  one  with  perfectly  abrupt  interfaces 
The  shift  in  the  PL  transition  wavelength  would  mtrror  the 
relaxation  of  the  microroughness.  If  this  interpretation  is 
correct  it  would  also  imply  that  the  samples  grown  without 
interruption  have  significant  microroughness.  It  may  be 
fruitful  to  examine  the  magnitude  of  the  peak  shift  m  con¬ 
junction  with  a  simulation  of  kinetic  processes  occurring 
on  the  surface  such  as  that  of  Ogale  el  al? 


IV.  CONCLUSION 

A  novel  method  involv  ing  the  growth  of  quantum  wells 
having  nominal  integer  plus  one-half  monolayer  thickness, 
in  conjunction  with  growth  interruption  him  been  used  to 
examine  MBE  growth  processes.  Our  results  indicate  that 
approximately  60  s  is  sufficient  to  smooth  the  interfaces 
under  these  growth  conditions.  The  continuous  shift  in  the 
transition  energies  with  interrupt  time  indicates  the  pres¬ 
ence  of  microroughness.  However,  degradation  of  the  sam¬ 
ples  with  increasing  interrupt  time  illustrates  the  impor¬ 
tance  of  the  vacuum  ambient  and  reducing  contaminants  in 
the  system. 
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operation  on  asymmetrical  Y  laaen.  Wt  obtain  up  to  32  dB 
sidemode  suppression.  The  tuning  response  of  a  device  with 
df.  ••  S6ftm  is  shown  in  Fig.  4  together  with  the  iheoreticel 
tuning  scheme  curve.  With  one  sin^  current  the  MZ  Alter 
is  tuned  across  closely  spaced  FP  cavity  modes  within  a 
5  3  nm  wavelength  range  which  corrctpondi  to  the  MZ  Alter 
width.  The  correlation  to  our  simple  theory  is  Ane  at  low 
injection  currents.  At  higher  injection  we  find  deviation  due  to 
our  assumptions,  taking  no  gain  saturation,  carrier  healing 
and  interaction  of  the  four  carrier  reservoirs  into  account. 

In  a  further  operation  mode  (not  shown  in  the  Figure), 
again  with  one  single  current  I,,  the  gam  curve  is  shiAed 
across  the  MZ  modes  of  the  asymmetrical  device.  As  an 
example,  for  a  Y  laaer  with  df.  •  81  ttm  (corresponding  to  an 
MZ  spacing  of  4  nm)  an  8 nm  shift  across  three  MZ  modes  is 
accomplished  in  a  first  experimental  demons  (ration. 

Coneluiions:  To  overcome  the  unfavourable  tuning  control  of 
the  symmetrical  Y  laser,  we  have  designed  and  realised  the 
first  asymmetrical  Y-laaer  structures.  For  ihesc  devices  the 
expected  much  simpler  tuning  scheme  is  confirmed.  Further 
extensibility  of  the  actually  measured  tingle  current  tuning 
spans  is  expected  for  optimised  devioea  and  appropriately 
designed  MZ  filler  functions.  The  asynunetrical  Y-lsser  struc¬ 
tures  offer  easily  controllable  wavelength  addressing  schemes. 
Therefore,  the  successful  implementation  of  the  enormous 
potential  of  Y  lasers  into  photonic  switching  and  WDM  irani- 
mission  systems  is  expect^ 
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Nowitzki  for  excellent  technical  assisianoe  This  work  is  par¬ 
tially  financially  supported  by  the  German  Minister  for 
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HIGH  EFFICIENCY  MICROWAVE  DIODE 
OSCILLATORS 

S.  Jgvalagi,  V.  Reddy,  K.  Cullapsili  and  O.  Neikirk 

Indexuui  terms  Osctlleters.  Mk-rowave  oscillofori,  Mtero^toot 
drcicr.t.  Oioctrr 


Dais  on  the  operation  of  vei>  high  etTiaeocy  mterowave 
oaallators  using  AlAa/lnCsAa  quantum  wet)  iruaction  uansii 
(QWlTn  diodes  art  ptcacmad  A  DC-io-RF  porrec  conver¬ 
sion  dTiaency  as  higli  as  50W  was  achievtd,  which,  lo  the 
auihort'  Imowledgc,  ia  the  highest  dficiency  reponed  for  con¬ 
tinuous  wave  |CW)  operation  of  a  two  tominal  acmicon- 
ducior  devKc. 


IniroducHon:  Bsraiuae  the  high  frequency  output  power  of  sn 
individual  semicondueior  devtos  ia  timited.  there  has  been 
increasing  inierest  in  new  power  comtMning  lechniquca  for 
microwave  to  submiliimetre  wave  ayatema.  For  inatanot.  targe 
output  powers  have  been  general^  using  two  dimensional 
grid  array  sourers  [I],  Although  total  RF  output  power  may 
be  increased,  OC-to-RF  power  converaion  efficiency  rtmsina 
an  intrinsic  property  of  ibe  semiconduaor  device.  For  two- 
terminal  diodes,  the  highest  DC-to-RF  converaion  clficienciea 
have  been  obtained  from  IMPATTt,  which  in  pulsed  oper- 
aiion  have  achieved  60V.  [2].  In  CW  operation,  onty  43V.  has 
been  achieved  [3].  Wc  present  DC  and  RF  data  for  (wo  AlAi, 
InCaAi  quantum  well  injection  iransii  (QWITT)  diodes  with 
1200  and  1800 A  depletion  region  lengths.  DC-to-RF  power 
converaion  efficiencies  in  (he  range  44-50%  were  obtained 
under  optimum  RF  dreuit/devioe  matching  conditions  In 
addition,  (he  ISOOA  device  produced  a  single  device  output 
power  of  20  mW  without  requiring  any  device  hettsinking  or 
aubsirale  thinning. 

Experimeni:  The  two  AlAa/InGtAs  QWITT  diodes  with  drift 
region  lengths  of  1800  and  1 200  A.  referred  to  u  QWITT  I 
and  QWITT  2.  respectively,  were  grown  on  n  +  InP  sob- 
siratca  by  moieculai  beam  epitaxy  The  motivation  for  the 
QWITT  diode  is  that  (he  voltage  and  current  differences 
between  the  peak  and  the  valley,  and  d/^,  respectively, 
must  be  made  as  large  as  pouiblc  lo  increase  output  power 
[4,  3],  The  quantum  well  ia  formed  by  sandwiching  a  47  A 
In,.)]Gap  tvAs  layer  between  23  A  strained  AlAs  layen.  The 
quantum  well  ia  tetween  a  symmetric  layer  profile  consisting 
of  15  A  of  undoped  In,  ,,Giio  .vAv  sdiaccni  to  the  well  fol¬ 
lowed  by  ^lOOA  n  (2  3*  lO’^cin'T  In,  j)C.lo  .7AS  and 
100 A  n  (I  3  *  I0”cm‘*)  lnp.,,Cu„ .^Ai  The  drift  region  is 
placed  below  the  quantum  well.  Standard  processing  teeb- 
niques  were  used  lo  febricaie  mesa  isoltied  diodes  with  Au 
topside  contacts.  The  room  temperature  peak  current  density 
wu  -  8  kA/em*  and  the  peak  to  valley  current  ratio  was  ~  20. 
The  itabtlirnd  (non-oiciilating)  current  density  against  voltage 
characterisiici  of  lyp^  diodes  are  shown  in  Fig.  I. 

Two  open  circuit  single  stub  resonator  circuits  were 
designed  using  300  microttrip  lines  for  oscillation  at  1  and 
2'7CHz.  The  design  is  based  on  the  Kurokawa  oscillaiion 
criterion  (i.e.  [6].  Both  small  signal  and 

large  signal  calculations  on  the  <^W1TT  diodes  show  a  very 
nearly  real  impedance  for  low  microwave  frequencies 
I  c  3GHz).  The  open  circuit  stub  in  the  circuit  produces  a  real 
impedance  at  the  device  only  over  a  narrow  band  of  fre¬ 
quencies;  thus  the  Kurokawa  condition  can  be  satisfied  only 
in  this  frequency  band.  For  the  stub  lengths  ctioaen  (he  circuit 
impedance  should  be  purely  real  at  -  I  and  2-7  GHz  for  cir¬ 
cuits  I  and  2,  reapectivtiy-  Because  the  diode  impedance  is 
nearly  real,  the  frequency  of  oadtlation  is  expected  lo  be  inde¬ 
pendent  of  the  diode  area  or  atructurc  (i.e.  QWITT  I  or 
QWITT  2).  The  diodes  were  biased  with  an  HP116I2A  bias 
tee  and  the  DC  port  was  monitored  for  bias  Hnc  oscillations. 

Results  and  discussion;  The  diode  mesas  were  contacted  by  a 
tungsten  whisker  at  the  end  of  the  circuit  and  different  areas 
on  each  sample  were  probed.  The  RF  output  power  was  mea¬ 
sured  by  placing  each  sample  at  the  end  of  the  circuit  and 
varying  the  DC  bias  voltage  until  maximum  output  power 
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was  obtuincvi  1  !k  iart;c  spccilii.  ncgausc  resistance  of 

the  deuce  Is  a  str»':ig  fuiKtion  of  the  Kf  arnpiitude,  hence  for 
a  fixed  RF  circuit  the  power  density  should  vary  with  device 


Fig.  1  Typical  slahihirJ  (nunisci/laimgl  DC  J-V  cunts  for  QWITT  I 
andQWITT! 

□  QWITT  1 

O  QWITT  2 

area.  The  frequency  spectrum  of  the  QWITT  oscillators  was 
(airly  narrow,  producing  a  typical  spectrum  which  was  ISdB 
down  3kHz  off  the  centre  frequency  for  a  resolution  band* 
width  of  1  kHz.  The  linewidth  was  limited  by  the  very  tow  Q 
circuit  used  here.  Particular  care  was  taken  to  ensure  the 
accuracy  of  the  RF  power  measurements  through  the  use  of  a 
calibrated  RF  power  meter.  The  values  reported  here  should 
be  conservative,  because  the  only  measurement  error  cor¬ 
rected  for  was  a  IdB  cabling  loss;  no  compensation  for  the 
loss  due  to  the  microsirip  circuit,  connectors,  and  bias  tee  was 
made. 

The  experimental  results  for  both  QWITT  1  and  QWITT  2 
operating  in  the  two  circuits  are  summarised  in  Table  1. 
Values  quoted  are  averages  for  four  to  five  nominally  identical 
diodes;  scatter  was  ^3%  of  the  average  value.  For  some 
devices,  a  DC-to-RF  power  conversion  elTiciency  as  high  as 
30%  was  achieved  at  a  power  density  of  4kW/cm*  for  the 
QWITT  I  structure;  the  average  result  for  several  diodes  was 
43%.  To  our  knowledge,  this  is  the  highest  efTiciency  ever 
reported  for  continuous  wave  operation  with  a  two  terminal 
device.  Conventional  IMPATT  diodes  have  been  shown  to 
have  efliciencies  of  60%,  but  only  in  pulsed  operation  [2],  For 
CW  operation,  IMPATTs  have  demonstrated  cfTiciencies  only 
u  high  u  43%,  but  more  typically  in  the  range  16-36%  [3, 
7].  Gunn  diodes  have  DC-lo-RF  conversion  efTtciencies  of  less 
than  13%  [8,  9].  The  highest  elTiciency  reported  for  quantum 
well  diode  oscillators,  hitherto,  was  11%  at  a  frequency  of 
1  GHz  [10]. 

The  maximum  power  obtained  from  a  single  device  was 
20 mW,  which  was  achieved  without  any  heatsinking.  This  is 
the  hipest  power  reported  to  date  for  quantum  well  diqde 


osL'iIlaiofij,  .1  factor  of  20  greater  than  previojiiy  ispousd  ,r. 
the  literature  (980pW  at  I  GHz  [10])  Such  high  power  i> 
possible  because  thee  QWITT  diodes  have  very  high  specd.c 
negative  resistance,  allowing  the  use  of  very  large  area  diodes 
without  producing  bias  line  oscillations. 

Quasistatic  power  calculations  were  performed  by  numen- 
cally  varying  the  RF  voltage  swing  at  dilTerent  bias  points  m 
the  NOR  region,  using  the  stabilised  DC  J-V  curves  of  both 
diodes  [3],  We  find  that  both  the  ■•a*fmum  measured  and 
calculated  RF  power  obtained  arc  greater  than  the  frequently 
quoted  '3/16  AF  A/‘  rule  [10].  This  it  because  the  DC  I-V 
curve  is  not  cubic,  as  used  to  obtain  the  ‘3/16  AF  A/'  rule 
[11],  and  that  the  optimum  bias  point  is  not  the  average  of 
the  peak  and  valley  voltages,  but  rather  closer  in  value  to  the 
valley  voltage.  At  the  optimum  bias  and  RF  voltage  swing, 
simple  large  signal  calculations  on  QWITT  1  (based  on  the 
J-V  curve  shown  in  Fig.  I)  yield  a  power  density  of  4TkW',' 
cm'  with  a  DC-lo-RF  conversion  elTiciency  of  42%,  and 
QWITT  2  yields  3  OkW/cm'  with  40%  efTiciency. 

In  conclusion,  we  have  presented  data  on  AlAs/InCaAs 
QWITT  microwave  oscillators  that  have  produced  record 
DC-lo-RF'  power  conversion  elTiciencies  as  high  as  30%  and 
CW  output  powers  of  20  m W  from  a  single  device 
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Table  1  AVERAGE  RESULTS  FOR  POWER  DENSITY  AND  DC-TO-RF  CONVERSION  EFFICIENCY  USING  QWITT  I 


AND  QWITT  2  IN  CIRCUITS  1  AND  2 


Device 

Circuit  1 

Circuit  2 

Sample 

Area 

Frequency 

Power  density 

Efficiency  i) 

Frequency 

Power  density 

Efficiency  it 

i*m*i «  i0**> 

CHx 

kw/cm* 

X 

CHl 

kW,»m* 

X 

QWITT  I 

300 

0  941 

42 

3  6 

39 

W  -  ISOOA 

160 

0931 

2’83 

39 

2  313 

4  0 

45 

68 

0933 

2-40 

27 

2-448 

2  8 

36 

QWITT  2 

300 

0943 

2'60 

33 

2-238 

15 

20 

W  -  t200A 

144 

0930 

300 

37 

2  2 

26 

68 

0947 

2-40 

33 

2-357 

17 

24 

Results  averaged  over  four  to  five  devices:  maximum  efficieniy  as  high  at  was  obtained  (or  QWITT  t  m  circuit  2  for  a  nominal 
160  X  to* 'em’ area  device 
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VOLTAGE  CONTROLLED  OSCILLATORS 
USING  COMPLEMENTARY  CURRENT 
MIRROR  INVERTER 

R.  R.  Pal,  H.  N.  Acharya,  S.  Kai  and 
N.  B  Chakrabarti 


oscillator  wherein  the  output  of  an  odd  number  of  serially 
connected  CCMIs  is  fed  back  to  the  input  stage.  The  fre¬ 
quency  of  oscillation  depends  on  the  number  of  inverter 
stages,  the  load  capacitances  and  the  current  level. 

To  determine  the  DC  behaviour  of  the  inverter,  we  assume 
that  the  base  input  voltage  of  Qj  is  V  and  its  collector  voltage 
is  Kj.  U  y„  ^  0  5  V,  the  quiescent  current  is  very  small.  The 
input  and  output  currents  I,  and  /,.  as  shown  in  the  Figure, 
are  given  by 

I,  -  (/.,!  f,.i)  e*p  My  +  K,)/k-n 

-  +  1^*)  exp  WK,  -  yykTI  (I) 

exp  My„  -  yykT] 

~  lu,  exp  [q{y  +  V„)/kT]  (2) 


Injfxiny  frrnu  Inverlen,  Voltagt  conirotifd  oscilUiiors.  Ring 
oscillaiors.  Circuit  dtsign 

Complementiry  current  mirror  inverieri  hive  Urge  band¬ 
width.  imill  time  delay  but  small  gain  whereas  complemen¬ 
tary  inverters  have  high  gain  but  large  lime  delay.  Ring 
oscillators  have  been  realised  using  a  combination  of  both 
types  of  inverter  to  achieve  stable  high  frequency  oscillation. 
Voltage  controlled  oscillators  were  obtained  by  tuning  the 
frequency  of  oscillation  with  supply  voltage. 


lniri)Juc;ion:  Voltage  controlled  oscillators  (VCOs)  are  widely 
used  in  such  applications  as  phase  locked  loops,  timing 
recovery,  modulators  and  demodulators,  and  frequency  syn¬ 
thesisers.  Ring  oscillator  VCOs  have  the  potential  of  very  high 
frequency  operation  and  low  cost.  The  oscillation  frequency  of 
a  ring  oscillator  is  fixed  and  is  determined  by  the  propagation 
delay  encountered  by  the  number  of  inverter  stages  used  to 
realise  the  oscillator.  Silicon  bipolar  technology  using  merged 
npn-pnp  complementary  inverters  is  attractive  for  the  realis¬ 
ation  of  high  frequency  VCOs  because  of  high  inlegrability 
and  reduced  process  complexity..  Merged  npn-pnp  complemen¬ 
tary  inverters  with  current  mirrors  appear  to  be  particularly 
suited  for  VCO  applications  because  the  associated  circuits  for 
the  phase  frequency  detector  are  realisable  in  current  mirror 
logic  using  the  same  configuration  [I].  Merged  npn-pnp 
inverters  can  be  used  in  three  basic  forms:  (he  npn  driver,  pnp 
current  soucc  [2,  3],  the  complementary  inverter  (Cl),  and  the 
complementary  inverter  with  current  mirror  (CCMI).  This 
Letter  describes  the  implementation  of  a  VCO  using  a  CCMI 
ring  o.scillator. 


yCO  using  complementary  current  mirror  inverters  (CCMIs): 
The  VCO,  as  shown  in  the  inset  of  Fig.  I,  is  primarily  a  ring 


Fig.  I  yarialinn  nf  osrillaiinn  frequency  with  supply  vnllage  of  three- 
stage  and  five-stage  CCMI  ring  oscillator 

Inset:  cireuii  diagram  of  threc-itage  CCMI  ring  oscillator 
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where  is  the  saturation  current  with  the  collector  shorted 
to  the  base  and  1^  is  the  base  current  with  active  collector. 

ffi*  ‘nd  I  MS  Mt  (he  collector  currents  of  transistors  Q,, 
Qt  and  (2„  respectively,  and  is  the  base  current  of  Q«. 

Assuming  ^(f,|/,y)  • ^(fws f,i*  VC(^ji/fji)3  * 
exp  (C,)  and  s/[(/«j//„«)]  -  exp  (C,)  where  /„  -  (/.„ 
+  f.n)  and  l,i  “  (Ipti  +  we  obtain 

,  „  ,  sinh  q(F  +  C,)A'/ 

~  •  /„sinhq(F  +  C, )/*■/• 

This  implies  that  transistors  Qi  and  would  need  to  have  a 
larger  area  for  current  gain.  The  small  signal  AC  gain  of  an 
inverter  stage  consisting  of  a  complementary  inverter  (Cl)  fol¬ 
lowed  by  a  mirror  can  be  expressed  in  terms  of  (he  trans- 
conductance  and  admittances  of  the  two  stages.  The  small 
signal  DC  gain  is  given  by 

Co  -  (e«i  +  +  g,m  +  Vr.  +  Oe,)  (4) 

where  Qt,  and  g^,  the  output  conductances  of  the  inverter, 
and  are  the  conductances  of  the  mirror  transistors. 
The  above  expression  shows  that  the  DC  gain  can  be  con¬ 
trolled  by  varying  the  relative  sizes  of  the  inverters  and  mirror 
transistors. 

Results  and  discussion:  The  VCO  circuit  (shown  in  Fig.  I)  was 
wired  in  a  PCB.  The  transistors  used  are  BF  494B  (npn)  and 
lie  I57n  (ppn)  whose  f,  are  3tX)  .md  I  50  M Hr.  respectively. 
The  experimental  results  of  the  VCO  realised  using  three- 
stage  and  hve-stuge  CCMI  ring  oscillators  are  shown  in  Fig.  1. 
The  frequency  of  oscillation  increases  from  62  to  66  MHz  for  a 
supply  voltage  variation  of  30 mV  for  the  three  stage  oscillator 
whereas  a  variation  of  10 MHz  (35-45  MHz)  is  obtained  over 
the  same  voltage  variation  for  the  five-stage  CCMI  VCO. 
Variation  of  frequency  with  load  capacitance  (shown  by  the 
dotted  line  in  the  inset  of  Fig.  I)  for  a  three-stage  CCMI  VCO 
is  shown  in  Fig.  2.  The  VCO  performance  is  linear  and  suit¬ 
able  for  PLL  applications.  The  photograph  of  a  typical  VCO 
waveform  obtained  from  a  three-stage  CCMI  VCO  without 
load  capacitance  is  shown  in  the  inset  of  Fig.  2. 

It  is  interesting  to  study  the  variation  of  oscillator  frequency 
when  a  combination  of  the  complementary  current  mirror 
inverter  (CCMI)  and  complementary  inverter  (Cl)  is  used  (Fig. 

3  (inset)).  Fig.  3  shows  the  experimental  results  of  a  three-stage 
ring  oscillator  where  combination  of  a  CCMI  and  Cl  is  used. 
The  frequency  of  oscillation  reduces  in  proportion  to  the 
number  of  CCMI  stages  which  are  replaced  by  a  CL  This  is 
consistent  with  the  fact  that  CCMIs  have  large  bandwidth, 
small  time  delay  but  small  gain,  whereas  CIs  have  high  gain 
but  large  time  delay.  The  maximum  frequency  of  oscillation 
reduces  from  65  to  4  (X)MHz  when  all  of  the  three  CCMI 
stages  are  replaced  by  a  Cl  (Figs.  I  and  3).  Because  CCMI 
gain  and  time  delay  depend  on  the  relative  areas  of  Q,. 
and  Q,,  Q,,  (he  operating  frequency  can  be  changed  by 
placing  the  inverters  in  parallel,  either  at  the  mask  stage  or 
subsequently.  It  will  be  noticed  from  Fig.  that  the  frequency 
of  oscillation  initially  increases  with  supply  voltage  and  thus 
with  current.  However  when  the  current  exceeds  5  mA  per 
stage,  the  output  frequency  decreases.  This  may  be  attributed 
to  (he  increase  of  diffusion  capacitance. 
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Hg.  i  shows  the  wavelorms  in  the  NSIS  as  observed  by  a 
streak  camera.  The  input  control  pulse  was  broadened  from  9 
to  12ps  as  it  travers^  the  loop  due  to  the  interplay  of  the 
normal  dispersion  and  seU^phase  modulation,  and  walked 
throu^  the  signal  by  lOps  (Fig.  3a).  Fig.  ib  shows  transmit¬ 
ted  (switched)  and  reflected  (unswitched)  signals  for  a  CW 
prote  signal  input  with  a  wavelength  of  lS52nm.  The  peak 
level  of  the  switched  signal  was  about  60%  of  that  with  com¬ 
plete  switching  due  to  the  low  control  pulse  power  (420 mW 
peak  at  Port  63).  The  switched  signal  had  a  Ups  width.  The 
switching  contrast  was  about  17dB  mainly  due  to  the  poor 
extinction  ratio  of  the  polarisation  controller  in  the  loop. 
Total  insertion  loss  of  the  NSIS  for  the  signal  was  l2dB. 

Time-division  demultiplexing  was  then  demonstrated  using 
a  fixed  pattern  input  data  stream.  The  input  signal  with  a 
wavelength  of  lSS2nm  was  generated  from  a  gain-switched 
DFB-LD  driven  at  5-6  GHz.  The  resultant  pulses  with  30  ps 
pulse  width  were  then  compressed  to  8p$  using  a  1250  m 
dispersion-shifted  fibre.  The  compressed  pulses  were  injected 
into  a  fibre  optic  pulse  pattern  generator  consisting  of  two 
highly-birefringent  fibres  and  a  following  fibre  potariser.  The 
two  birefringent  fibres,  whose  polarisation  dispersion  was 
2ps/m,  were  about  10  and  20  m  in  length  and  connected  with 
their  polarisation  axes  offset  by  45°.  The  two  fibres  delay  the 
input  pulses  by  about  20, 40  and  60  ps  to  produce  a  50'4Gbit/ 
s  *. . .  0111 1000001 1110  . . .’  pattern  data  stream  as  shown  in 
Fig.  3c.  The  resultant  data  stream  was  then  injected  into  the 
NSIS.  Fig.  id-e  show  the  demultiplexed  5-6Gbit/s  *...  II  ...’ 
signal  (Fig.  3d)  and  the  remaining  '. ..  OllOlOOOOOllOlO  ...' 
signal  (Fig.  3e).  The  PLC  was  seen  to  be  stable  even  though 
no  temperature  control  was  employed. 

We  have  reported  the  first  NSIS  that  is  practical.  It  is  based 
on  an  integrated  silica-based  planar  lightwave  circuit  which 
provides  wavelength  selecting  functions  with  the  low  insertion 
loss  (<4  3dB),  high  isolation  (>40dB),  and  high  environ¬ 
mental  stability.  The  NSIS  was  tested  and  demonstrated 
diode-pumped  time-division  demultiplexing  at  504  Gbit/s 
with  a  13  ps  switching  lime  and  a  5  6  GHz  repetition  rate. 
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WAVELET  ANALYSIS  OF 
ELECTROMAGNETIC  BACKSCATTER  DATA 

H.  Kim  and  H.  Ling 

Indexing  terms:  Electromagnetic  wooes.  Wavelets,  Signal  pro¬ 
cessing,  Scattering 

An  analysis  of  electromagnetic  backscaner  data  using  the 
wavelet  uansfoim  is  presented.  The  multiresolution  pioperty 
of  wavelets  enables  scattering  events  with  different  scales  in 
the  frequency  domain  to  be  simultaneously  resolved.  The 
backscaitered  signal  from  a  dispersive  plasma  cylinder  is 
studied  using  the  wavelet  analysis.  The  different  scattenng 
mechanisms  are  clearly  identifiable  in  the  time-frequency  rep¬ 
resentation. 


Introduction:  The  electromagnetic  energy  backscattered  from 
an  unknown  scatterer  can  provide  useful  information  for  iden¬ 
tifying  and  characterising  the  scatterer.  Recently,  the  running 
window  Fourier  transform  (RWFT)  (also  known  as  the  short 
time  Fourier  transform  [2.  3])  was  applied  to  the  back- 
scattered  frequency  response  from  an  open-ended  waveguide 
cavity  to  obtain  a  joint  time-frequency  representation  of  the 
original  signal  [1].  In  the  time-frequency  plane,  localised 
events  in  time  (scattering  centres),  localised  events  in  fre¬ 
quency  (resonances),  and  dispersive  scattering  mechanisms  can 
simultaneously  displayed.  Consequently,  a  good  insight 
was  gained  on  the  different  scattering  mechanisms  which, 
when  properly  interpreted,  can  be  correlated  with  the  scatterer 
features  for  target  identification  applications.  However,  there 
is  one  intrinsic  limitation  associated  with  the  RWFT.  The 
scattered  signal  in  the  frequency  domain  generally  comprises 
scattering  mechanisms  with  different  characteristic  scales.  For 
example,  high-Q  resonance  phenomena  are  small-scale  fre¬ 
quency  events  whereas  contributions  from  nondispersive  scat¬ 
tering  centres  extend  over  large  frequency  scales,  Because  the 
RWFT  uses  a  fixed  window  width,  it  cannot  resolve  multiple 
events  with  different  scales  simultaneously.  In  this  Letter,  a 
wavelet  analysis  of  backscatter  data  is  presented  as  an  alterna¬ 
tive  tool  to  the  RWFT.  The  wavelet  transform  has  attracted 
much  attention  in  the  signal  processing  community  recently 
[2-5],  In  contrast  to  the  RWIT*,  the  wavelet  transform  tech¬ 
nique  has  a  multiresolution  property  through  the  use  of  multi- 
scale  windows.  Therefore,  multiscale  events  of  frequency  can 
be  more  effectively  resolved  by  the  wavelet  transform.  As  an 
example,  a  wavelet  analysis  of  the  backscattered  frequency 
data  fiom  a  dispersive  plasma  cylinder  is  presented. 

Wavelet  analysis:  We  define  the  wavelet  transform  of  a  fre¬ 
quency  signa’  F[(o)  as 

W/t,n)  =  jF(ti))t*'’H(T(wn))dti>  (lu) 

H(ai)  is  usually  referred  to  as  the  'mother  wavelet'.  Note  that 
the  wavelet  transform  is  defined  in  the  frequency  domain  for 
our  application,  contrary  to  its  usual  definition  in  the  time 
domain.  Eqn.  1  can  be  interpreted  as  the  decomposition  of  the 
frequency  response  F{w)  into  a  family  of  shift^  and  dilated 
wavelets  H{x(to  ■  O)).  By  shifting  //(<u)  with  a  fixed  scale 
parameter  t,  the  r-scale  mechanisms  in  the  frequency  response 
F{o))  can  be  extracted  and  localised.  Alternatively,  by  dilating 
Hlto)  at  a  fixed  O,  all  of  the  multiscale  events  of  F((ti)  at  Q  can 
be  analysed  according  to  the  scale  parameter  t.  This  is  the  so 
called  'multiresolution'  property  of  the  wavelet  transform  and 
is  an  important  advantage  over  the  RWFT. 

By  properly  manipulating  eqn.  la,  the  wavelet  transform 
can  also  be  carried  out  on  the  time  domain  signal  /(r)  of  the 
origjnat  frequency  domain  response  F(ci>). 

W/T,  Q)  =  J /(t)t -  ''‘Mt/rje^  dt  {lb) 

h(t)  is  the  Fourier  transform  of  H(io)  and  must  satisfy  the 
'admissibility  condition',  i.e.  6(0)  =  0.  Because  eqn.  16  is  essen¬ 
tially  the  Fourier  transform  of  [/(i)6(r/T)],  it  is  the  preferred 
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nu"iericai  implementation  of  the  wavelet  transform  through 
the  use  of  the  FFT. 

Backscatter  from  plasma  cylinder:  The  backscatter  from  a 
plasma  cylinder  is  considered.  Our  interest  in  this  problem 
stems  from  the  study  of  exhaust  plume  plasma  from  electro¬ 
thermal  arqet  thrusters  [6].  The  circular  plasma  cylinder  has 
a  radius  of  1 1-94  cm.  A  cold  plasma  model  is  assumed  to 
model  the  cylinder  as  a  dispersive  dielectric  with  c,  »  [1 
~  (Z^)*].  /,  is  the  plasma  frequency  and  is  chosen  to  be 
10  GHz.  To  generate  the  backscatter  frequency  response,  the 
eigenfunction  solution  [7]  for  a  circular  dielectric  cylinder  is 
implemented  numerically.  A  complex  Be»el  function  routine 
[8]  is  used  in  geiterating  the  numerical  data.  The  simulated 
backscattered  response  from  the  plasma  cylinder  from  3  to 
17  GHz  is  shown  in  Fig.  1.  We  observe  that  the  frequency 
response  consists  of  two  regions;  the  flat  response  below  the 
plasma  frequency  and  the  oscillatory  pattern  above  the 
plasma  frequency.  By  examining  the  frequency  response  more 
closely  around  the  plasma  frequency,  we  also  observe  addi¬ 
tional  small-scale  features  which  are  due  to  resonance  and 
dispersion  phenomena.  In  this  instance,  the  frequency 
response  is  composed  of  multiscale  events  and  the  wavelet 
analysis  is  ideally  suited  to  analysing  these  multiscale  fre¬ 
quency  data. 


Fig  I  Baclacattered  freifuency  response  of  uniform  plasma  cylinder 
with /,  =  lOCHl  and  radius  It  94  cm 

The  wavelet  transform  was  implemented  based  on  eqn.  th 
using  the  FFT  routine.  The  function  lit)  was  chosen  to  be  a 
two-sided  Kaiser-Bessel  window  with  a  constant  Q  factor  of 
O' 3.  (The  Q  factor  is  defined  as  the  ratio  between  the  window 
width  and  the  window  centre.)  The  resulting  time-frequency 
plot  obtained  using  the  wavelet  transform  is  shown  in  Fig.  2. 
Also  plotted  along  the  two  axes  are  the  time-domain  and  the 
frequency-domain  responses.  Two  distinct  scattering  mecha¬ 
nisms  can  be  identified  in  the  wavelet  domain:  a  vertical  line 
due  to  the  exterior  specular  reflection  of  the  cylinder  and  a 
dispersion  curve  due  to  the  once-intemally  refte^ed  contribu¬ 
tion  from  the  energy  penetrating  into  the  cylinder.  Below  the 
plasma  frequency,  the  plasma  is  cutoff  to  wave  propagation 
and  the  backscattering  comes  solely  from  strong  exterior 
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Fig.  2  Time-frequency  wavelet  representation  of  backscatter  data  from 
uniform  plasma  cylinder 

Grey  scale  plots  of  intensity  art  in  dB  with  dynamic  range  of  4gdB 


reflection.  Above  the  plasma  frequency,  the  plasma  cylinder  is 
partially  penetrable  and  the  onoe-internally  reflected  contribu¬ 
tion  is  present.  The  once-internally  reflected  wave  traverses 
twice  the  diameter  of  the  cylinder  through  the  dispersive 
plasma.  Because  the  phase  velocity  of  the  wave  in  the  plasma 
is  proportional  to  the  travel  time,  the  curve  in  the  wavelet 
plane  in  fact  represents  the  dispersion  diagram  of  the  plasma. 
At  the  plasma  frequency,  the  phase  velocity  approach^  infin¬ 
ity.  Physically,  this  corresponds  to  the  electron  resonance 
around  the  heavy  ions  in  the  plasma.  As  the  frequency 
increases  beyond  the  plasma  frequency,  the  phase  velodty 
approaches  the  speed  of  light  in  vacuum.  In  between  the  two 
regions  the  plasma  is  highly  dispersive.  The  wavelet  transform 
successfully  resolves  the  different  events  with  widely  varying 
scales  in  the  frequency  domain.  It  is  worthwhile  to  point  out 
that  the  RWFT  would  not  be  able  to  simultaneously  resolve 
the  large-scale  (conesponding  to  the  initial  peaks  in  lime 
domain)  and  small-scale  events  (resonance  and  dispersion). 


Fig  3  Time-frequency  wavelet  representation  of  backscatter  dote  from 
two-layer  piastna  cylinder 

Inner  layer  has  radius  of  4-78 an  with/,  »  8 GHz  and  outer  layer 
is  of  thickness  7-  I6cm  with /,  «  SGHi 

Grey  scale  plots  of  intensity  are  in  dB  with  dynamic  range  of  S2dB 
G,  =  8  GHz.  r,  *  4  7gctn 
/,j  =*  5GHz,  r,  =  11  94 cm 

The  result  for  a  two-layer  plasma  cylinder  is  shown  m  Fig 
3.  The  inner  layer  has  a  radius  of  4-  78  cm  with 8  GHz  and 
the  outer  layer  is  of  thickness  7  !6cm  with  /,  »  SGHz.  We 
observe  three  regions  in  the  frequency  response  with  addi¬ 
tional  small-scale  features  around  the  plasma  frequencies. 
Again,  the  time-frequency  representation  using  wavelets 
allows  the  detailed  scattering  mechanisms  to  be  resolved. 
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CONOUCTOR'BACKED  TELLEQEN  SLAB  AS 
TWIST  POLARISER 


where  7,  ••  denotes  the  two-dimensional  unit 

dyadic  and  I  “  a,  x  T  w  the  90°  rotator.  K„  and 

are  the,  respe^ive,  refl^on  coenicients  corresponding  to 
(he  copoiarised  and  crosspolahsed  components  of  the  reflec¬ 
ted  field  when  the  incidrat  field  is  linearly  polarised.  The 
angles  and  d  are  real  for  a  lossless  slab,  ^  gives  the  phase 
shiA  and  ^  the  polarisation  rotation,  of  tte  reflected  wave 
with  respect  to  the  incoming  wave.  The  rotation  angle  ^  is 
positive  in  the  right-hand  direaion  when  looking  at  the  slab 
interface. 

The  dependence  of  and  ^  on  the  parameters  of  the  slab 
can  be  expressed  as 


I.  V.  Lindell,  S.  A.  Tretyakov  and  M.  I.  Oksaaen 


Indexing  terms:  Polarisation,  Electromagnetic  waves 

A  conductor-backed  slab  of  oooreciprocal  bi-isolropic 
material  with  nonzero  Tdlegen  parameter  is  shown  to  act  aa 
a  twist  polariser  (crosspolarising  reflector)  when  the  medium 
parameters  of  the  slab  satisfy  a  certain  condition,  whence  a 
linearly  poiariaed  incident  plane  wave  is  reflected  as  cross- 
polarised.  This  effect  is  seen  to  be  broadbtaded  for  large 
values  and  narrowbanded  for  low  values  of  the  Tellegea 
parameter  and  the  operation  is  independent  of  the  poiarisa- 
tioa  direction  of  the  incident  wave.  Analysis  of  the  Tellegen 
slab  is  outlined  and  the  two  operating  modes  for  the  twist 
polariser  ate  discussed. 


Introduction:  Novel  effects  and  devices  have  been  recently 
suggested  based  on  applications  of  chiral  media,  which  are 
reciprocal  special  cases  of  the  most  general  isotropic,  or  bi¬ 
isotropic  (Bl),  media.  The  chiral  parameter  k  and  the  Tellegen 
parameter  z,  which  is  responsible  for  nonreciprodty  of  the  BI 
medium,  appear  in  similar  positions  in  the  medium  equations 

tl] 

D  =  eE  +  (z  -  0) 

A- (z+  (2) 

For  lossless  BI  media,  the  four  parameters  e,  ft,  z<  x  ‘te  teal 
quantities.  In  this  Letter,  it  is  demonstrated  that  a  slab  of  BI 
medium  with  nonzero  Tellegen  parameter  z  ami  backed  by  a 
perfectly  conducting  plane  can  be  designed  to  rotate  the  inci¬ 
dent  linearly  polarised  field  in  reflection  by  90°.  This  type  of 
operation  is  basic  in  its  action  and  its  use  in  antenna  and 
microwave  devices  has  been  described  earlier  [2-3]  with  struc¬ 
tures  that  require  the  polarisation  of  the  incident  wave  to  be 
at  a  fixed  angle  to  an  axis  of  the  structure. 

An  isotropic  medium  with  a  nonredprocity  parameter, 
denoted  here  by  y,  was  first  suggested  by  Tellegen  in  1948  [6] 
to  produce  a  nonreciprocal  device  called  gyrator.  He  also 
explained  that  to  make  such  a  medium  in  practice  would 
require  a  microscopic  structure  of  similar  parallel  permanent 
electric  and  magnetic  dipoles.  This  possibility  has  also  been 
considered  by  others  [7.  8]  and  a  method  has  been  suggested 
which  ai^lies  parallel  electric  and  magnetic  fields  to  certain 
substances  such  as  CrO]  while  they  are  cooled  to  a  tem¬ 
perature  at  which  the  electric  and  magnetic  polarisations 
become  permanent  [9].  The  problem  of  fabricating  such  a 
medium  is  not  considered  here  in  any  more  detail. 

Reflection  dyadic:  Applying  vector  transmission-line  analysis*, 
where  any  layered  medium  with  homogeneous  BI  media  and 
boundaries  parallel  to  the  xy  plane  can  be  replaced  by  a  trans¬ 
mission  line  with  dyadic  characteristic  impedances  and  dyadic 
propagation  factors,  the  reflection  dyadic,  corresponding  to 
normal  incidence  of  plane  waves,  of  a  Bt  slab  of  ihickneu  d 
and  with  a  perfectly  conducting  backplane  can  be  written  in 
the  dyadic  form 

K  «•  e**icot  ^T,  -nin  «•  R„  T,-y  R„J  (3) 


tan 

tan 


fftio  cos  y  sin  (2kd  cos  v) 

(•I*  +  1o)  sm^fkd  cos  v)  -  (jJ  cos^v 

2y)o  sin  v  sin^fkd  cos  v) 

(ij*  -  ifl)  sin’flul  cos  v)  -)■  cos^v 


(4) 

(5) 


We  denote  i/o  yOfo/Co)  Mtd  quantities  of  the  BI  medium  by 
I)  -  kmnkaw  n  -  sin  v  - 

Z/"  Zr-  Z,  >s  called  the  relative  Tellegm  parameter.  It  is  seen 
that  the  chirality  parameter  k  does  not  affect  the  reflection 
dyadic  at  all  in  the  present  notation;  with  other  formalisms 
this  is  not  always  true  [I],  On  the  hand,  the  Tellegen  param¬ 
eter  z  has  an  essential  rdie  in  producing  polarisation  rotation. 
For  a  lossless  slab  with  real  eqn.  3  implies  the  relation 
|R„|^  »  1,  or  the  sum  of  co-  and  crosspolarised 

reflected  powers  equals  the  inddeni  power. 

The  nonreciprocal  character  of  a  medium  with  nonzero  Tei- 
legen  parameter  z  is  seen  most  easily  from  the  property  of 
having  nonzero  R^.  In  fact,  in  trying  to  reverse  the  reflection, 
it  is  seen  that  an  incident  wave  with  the  polarisation  of  the 
previously  reflected  wave  would  not  in  general  reflect  with  the 
polarisation  of  the  previously  incident  wave. 


Twist  polariser  Let  us  find  the  condition  under  which  the 
copolarised  refh  rion  coefficient  vanishes,  in  which  case  all  the 
power  incident  vitb  linear  polarisation  is  reflected  in  cross¬ 
polarisation.  From  eqn.  3  we  see  that  the  condition  0  is 
satisfied  when  cos  0  0,  i.e.  when  the  parameters  kd,  z  *>*<1 

if/iTo  Bfc  such  that  the  denominator  of  tan  ^  in  eqn  5  van¬ 
ishes. 


sin  (kd  cos  v)  - 


»ToC<»  >■ 


This  has  real  solutions  for  lossless  BI  media  when  the  inequal- 
rty  *1/70  S  i  rin  y  I  -  1 2, 1,  or  p,  s  I  z  I »  satisfied  with  ^  -  n/Po 
To  have  «•  0  and  #  0,  the  Tellegen  parameter  obvi¬ 
ously  must  not  be  aro.  The  crosspolarised  reftection  coeffi¬ 
cient  is  from  eqn.  3  simply  the  phase  factor  -  r*.  with  cos 
ijr  —  sin  V  »  itjz-  Thus,  the  conductor-backed  BI  slab 
with  parameters  satisfying  eqn.  6  acu  as  a  CTOsspoiarising 
reflector  (twist  polariser)  creating  a  rotation  of  90°  and  a 
phase  shiA  <lr  for  the  reftoed  field.  The  total  field  in  front  of 
the  slab  will  in  this  case  be  ellipticaliy  polarised  with  the  axial 
ratio  AR  «  tan  (p/2). 

The  twist  polariser  effect  is  depeixfent  on  the  Tellegen 
parameter,  which,  in  addition  to  the  previous  inequality  must 
satisfy  another  one,  lz,l  S  I.  for  the  energy  function  to  be 
positive  semidefinitet. 

There  are  two  interesting  modes  of  operation  for  the  cross- 
polarising  reflector:  wide-band  operation  which  damps  the 
copolarised  reflection  in  a  wide  frequency  band  and  narrow- 
band  operation  which  suggests  uw  in  filtering  narrow  bands 
of  signals. 

In  wideband  operation,  the  value  of  the  relative  Tellegen 
parameter  y,  should  be  close  to  unity.  In  theory,  for  z,  “  1 
there  is  an  infinite  band  for  which  |  R,,  |  is  smaller  than  a  given 
number  A.  but,  if  A  is  small,  the  thkdtness  d  becomes  large. 
Thus,  for  practical  use,  somewhat  lower  values  are  more  suit¬ 
able.  As  seen  in  Fig.  1,  for  tj/ijo  “•  O-*  «nd  z,  “  0.8S  we  have 


•  UNDCLL,  I.  V,  tuctyakov,  e  a,  and  oksancn.  m.  i.;  ‘Vector 

transmittioD-line  and  circuit  theory  for  bi-iaotropic  layered  itnic-  t  undgll,  i.  v.:  'Methods  for  efectitHnagnetic  field  aoalysii  fOdord 
tures*.  fubmitted  to  J.  Electro  Magnetic  Wave  Appl.  Clarendon  Press).  To  be  published 


140 


IEEE  microwave  AND  GL  IDED  WAVE  LETTERS.  VOL.  2.  NO  4.  APRIL  1992 


Wavelet  Analysis  of  Backscattering  Data 
from  an  Open-Ended  Waveguide  Cavity 

Hao  Ling,  Member,  IEEE,  and  Hyeongdong  Kim,  Student  Member,  IEEE 


Abstract — The  wavelet  analysis  technique  is  applied  to  an¬ 
alyze  the  electromagnetic  backscattering  data  from  an  open- 
ended  waveguide  cavity.  Compared  to  the  conventional  short- 
time  Fourier  transform,  the  wavelet  transform  results  in  a  better 
representation  the  scattering  features  in  the  time-frequency 
plane,  due  to  its  multiresolution  property. 

I.  Introduction 

HE  electromagnetic  energy  backscattered  from  an  un¬ 
known  target  can  provide  information  useful  for  classify¬ 
ing  and  identifying  the  target.  This  is  commonly  accomplished 
by  interpreting  the  radar  echo  in  either  the  time  or  the 
frequency  domain.  For  target  characteristics  which  are  not  im¬ 
mediately  apparent  in  either  the  time  or  the  frequency  domain, 
the  joint  time-frequency  representation  of  the  radar  echo  can 
often  provide  more  insight  into  the  scattering  mechanisms  and, 
when  properly  interpreted,  can  lead  to  successful  identification 
of  the  target.  In  a  recent  paper  by  Moghaddar  and  Walton  { 1 ), 
the  Joint  time-frequency  analysis  of  an  open-ended  waveguide 
cavity  was  carried  out.  In  that  work,  both  the  short-time 
Fourier  transform  (STFT)  and  the  Wigner-Ville  distribution 
were  applied  to  the  broad-band  backscattering  data  from  a 
dispersive  cavity  structure  to  arrive  at  the  time-frequency 
representation.  Good  insights  on  the  scattering  mechanisms 
were  gained  from  the  results.  However,  the  STFT  is  limited  by 
its  fixed  resolution  in  both  the  time  and  the  frequency  domain. 
The  Wigner-Ville  distribution,  although  providing  good  local¬ 
ization  of  scattering  mechanisms,  introduces  additional  cross 
terms  which  leads  to  “ghosts”  in  the  time-frequency  plane. 

The  theory  of  wavelets  is  currently  attracting  a  great  deal  of 
attention  in  many  disciplines  of  applied  science  (2H5j.  In  this 
letter,  the  wavelet  transform  is  applied  to  the  backscattering 
data  from  an  open-ended  waveguide  cavity  in  order  to  derive 
the  time-frequency  representation  of  the  signal.  Contrary  to 
the  conventional  STFT  that  has  fixed  resolution  in  both  time 
and  frequency,  the  wavelet  transform,  when  properly  defined, 
can  provide  variable  resolution  in  time  and  multiresolution  in 
frequency.  Since  the  early-time  radar  echo  from  finite  objects 
usually  consists  of  sharp  peaks  [6],  very  fine  time  resolution 
is  needed  to  resolve  the  various  scattering  centers.  On  the 
other  hand,  since  the  late-time  arrivals  are  characterized  by 
resonant  ringing,  good  frequency  resolution  (or  coarse  time 
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resolution)  is  needed  for  isolating  the  target  resonances.  In 
the  intermediate  region,  dispersive  phenomena  require  good 
resolution  in  both  time  and  frequency.  The  multiple  resolution 
property  of  the  wavelet  transform  is  ideally  suited  for  this  task. 
Consequently,  the  wavelet  representation  can  provide  a  better 
time-frequency  characterization  of  the  backscattering  data. 

II.  Short-Time  Fourier  Transform 
And  Wavelet  Transform 

The  conventional  STFT  of  a  time  signal  fit)  is  defined  as 
(31.  14); 

5(r.  ^)  =  J  (la) 

It  is  essentially  the  Fourier  transform  operation  with  the 
addition  of  a  time  window  function  g(t).  The  translation  of 
the  window  as  a  function  of  r  results  in  a  two-dimensional 
time-frequency  representation.  5(t.  fl),  of  the  original  time 
function.  By  manipulating  (la),  the  STFT  can  also  be  ex¬ 
pressed  in  the  frequency  domain: 

5(r.n)  =  c-J"'  J  F(uj)G(Q  -  (lb) 

Here  F(a))  and  G{ui)  are  the  Fourier  transforms  of  f{t)  and 
/;(/),  respectively.  We  observe  from  (la)  and  (lb)  that  the 
STFT  representation  can  be  obtained  through  either  a  moving 
window  in  time  g{t)  or  a  corresponding  moving  window  in 
frequency  G{uj). 

We  will  now  introduce  the  (continuous)  wavelet  transform 
of  a  time  signal  /(()  for  our  application: 

IF;(r,n)  =  j  f{t)T-^^^h{t/T)ej^^dt.  (2a) 

By  comparing  (la)  and  (2a),  we  see  that  h{t)  is  similar  to  the 
window  function  g(i.)  in  the  STFT.  However,  h{t.)  must  satisfy 
an  additional  “admissibility  condition”  in  wavelet  theory  (5), 
viz..  h(t  =  0)  =  0.  To  satisfy  this  condition,  h(t)  is  usually 
chosen  to  be  a  translated  window  function  with  its  center  at 
(o-  By  changing  t.  the  center  of  the  window  function  moves 
as  Tto  and  the  width  of  the  window  is  dilated  by  the  scale 
factor  r.  The  ratio  between  the  window  width  and  the  window 
center  (or  the  Q-factor  of  the  window  function)  remains  fixed 
for  all  time.  This  is  in  contrast  to  the  STFT  where  the  window 
width  is  fixed  for  all  time.  By  properly  manipulating  (2a), 
the  wavelet  transform  can  also  be  carried  out  on  the  Fourier 
transform  F{u))  of  the  original  time  signal: 

Wf{T,Q)  =  I  F(u;)t^/2^(t(u;  -  n))dw.  (2b) 
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Fig.  I  Geometry  ol  the  open-ended  circular  waveguide  cavity. 


H{^)  is  the  Fourier  transform  of  h{t)  and  is  usually  referred 
to  as  the  "mother  wavelet."  The  operation  in  (2b)  can  be 
interpreted  as  the  decomposition  of  the  frequency  signal  F(uj) 
into  a  family  of  shifted  and  dilated  wavelets  -  fl)).  It 

is  important  to  point  out  here  that  the  present  definition  of  the 
wavelet  transform  in  its  time  and  frequency  forms  is  exactly 
opposite  to  the  common  definition  of  the  wavelet  transform 
used  in  time-series  signal  analysis  (5). 


III.  Time-Freqiency  Representation 
Of  Backscattering  Data 

The  time-frequency  representation  of  the  backscattering  data 
from  an  open-ended  cavity  is  considered.  The  cavity  is  an 
open-ended  circular  waveguide  with  a  diameter  of  4.445  cm. 
A  flat  conducting  termination  exi.sis  60.96  cm  inside  the 
waveguide  (Fig.  1 ).  To  generate  the  backscattering  data,  the 
radar  cross  section  of  this  target  is  first  computed  in  the 
frequency  domain.  We  take  into  account  of  the  interior  cavity 
contribution  using  a  modal  approach  (7|  and  the  diffraction 
contribution  from  the  front  rim  of  the  cavity  using  the  asymp¬ 
totic  formula  in  (S).  It  has  been  previously  established  that  the 
backscattering  data  predicted  in  this  manner  agree  reasonably 
well  with  experiments.  The  time-domain  response  is  then 
obtained  by  Fourier  transforming  the  band-limited  frequency 
data  (from  2  to  18  GHz). 

Fig.  2(a)  show  s  the  time-frequency  plot  of  the  backscattering 
data  at  normal  incidence  {H  =  0°)  using  the  STFT.  In 
performing  the  STFT.  a  2-GHz  Kaiser-Bessel  window  in  the 
frequency  domain  is  used  in  equation  (lb).  Also  plotted  along 
the  two  axes  are  the  time-domain  and  the  frequency-domain 
responses,  h  is  apparent  that  the  scattering  features  are  much 
better  resolved  in  the  time-frequency  domain  than  in  either  the 
time  or  the  frequency  domain  alone.  Both  the  nondispersive 
rim  diffraction  and  the  two  mode  spectra  due  to  the  TEn 
(with  cutoff  at  3.96  GHz)  and  the  TEi:  (1 1.45  GHz)  mode  can 
be  clearly  identified.  The  mode  spectra  are  in  fact  dispersion 
curves  of  the  waveguide  modes  since  the  phase  velocity  of 
each  mode  is  proportional  to  the  travel  time.  The  noncau.sal 
noise  appearing  in  the  time-frequency  plot  is  caused  by  the 
modal  approximation  used  in  simulating  the  backscattering 
data.  If  actual  measurement  data  were  u.sed.  this  noise  should 
be  absent. 

Due  to  the  fixed  resolution  of  the  STFT.  the  scattering  fea¬ 
tures  in  Fig.  2(a)  are  smeared  out  in  the  time-frequency  plane. 
This  problem  is  overcome  by  using  the  wavelet  transform  that 
provides  a  much  better  representation  of  the  scattering  features 
in  the  time-frequency  plane,  as  shown  in  Fig.  2(b).  The  wavelet 
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Fig.  2-  Timc-frequcncy  representation  of  backscaliering  data  from  an 
open-ended  cavity  under  normal  incidence.  TTie  grayscale  plol.s  of  intensity  are 
in  decibels  with  a  dynamic  range  of  40  dB.  (a)  Short-time  Fourier  transform 
(STFT)  representation  (b)  Wavelet  transform  representation. 


transform  is  implemented  using  equation  (2a)  with  the  aid 
of  the  FFT.  The  function  h{t)  is  chosen  to  be  a  two-sided 
Kaiser-Bessel  window  with  a  Q-factor  of  0.3.  The  t  =  0 
reference  of  h{t)  is  located  midway  between  the  time  events 
from  the  rim  diffraction  and  interior  contribution  (at  r  =  2.05 
ns).  The  variable  time  resolution  of  the  wavelet  transform 
allows  sharper  time  resolution  to  be  achieved  during  the  early- 
lime  response  and  sharper  frequency  resolution  (coarser  time 
resolution)  to  be  achieved  during  the  late-time  response.  Thus, 
wavelet  transform  provides  good  resolution  in  identifying  the 
scattering  centers  and  resolving  the  resonant  phenomena  of 
the  target  while  adequately  describing  the  dispersive  scattering 
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Fig.  3.  Time-frequency  represeniaiion  of  backKattenng  data  from  an  open-ended  cavity  under  45'  incidence  The  grayscale 
plots  of  Intensity  are  in  decibels  with  a  dynamic  range  of  40  dB  la)  Shon  time  Fourier  transform  ISTFT)  representation  Ibi 
Wavelet  transform  representation 


mechanisms  in  the  intermediate-time  region.  Figs.  3(a)  and 
3(b)  show  the  time-frequency  plots,  generated  using  the  STFT 
and  the  wavelet  transform,  respectively,  of  the  same  cavity  at 
45°  incidence.  Many  more  modes  are  excited  by  the  obliquely 
incident  wave.  Consequently  the  time  domain  response  is 
much  more  dispersive.  By  comparing  Figs.  3(a)  and  3(b),  we 
find  that  the  wavelet  representation  again  provides  a  much 
sharper  resolution  of  the  different  scattering  mechanisms  than 
the  STFT. 
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received  signal  levels  of  antennas  (d)  and  (e)  are  the  same  as 
that  of  antenna  (a),  i.e.,  the  Ao/2  dipole  antenna.  The  loaded 
monopole  mobile  antenna  has  also  been  tested,  and  its  re¬ 
ceived  signal  level  is  about  6  dB  lower.  These  facts  justify  the 
applicability  of  antennas  (d)  and  (e),  i.e.,  the  meander  line 
PCD  As. 

4.  CONCLUSION 

A  meander  line  PCDA  of  9.2-cm  length  with  45.9%  SR  and 
5%  bandwidth  of  2:1  VSWR  has  been  designed  for  PCN 
handset  usage  operating  at  center  frequency  /o  =  880  MHz. 
Owing  to  its  planar,  lightweight,  thin,  and  small  character¬ 
istics,  the  PCDA  using  meander  line  is  well  suited  for  signal 
transmission  and  reception  of  the  PCN  handset,  in  which  the 
PCDA  may  be  built  in,  and  various  antenna  diversity  schemes 
may  be  used  for  fading  reduction.  If  the  shortened  PCDA  is 
still  too  long  to  be  accepted,  then  the  monopole  version  of 
the  same  configuration  may  be  constructed  to  get  half  a  length 
shorter,  but  at  the  price  of  lower  radiation  efficiency. 
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ABSTRACT 

A  boundary  integral  formulation  is  presented  for  computing  the 
scattering  from  infinite  periodic  gratings.  Periodic  boundary  condi¬ 
tions  and  expansion  of  the  fields  in  terms  of  Floquet  harmonics  al¬ 
low  the  boundary  integral  formulation  to  treat  just  a  single  period  of 
the  infinite  grating.  Numerical  results  for  a  blazed  grating  designed 
for  use  at  infrared  wavelengths  are  presented  and  shown  to  be  in 
good  agreement  with  a  series  of  measurements.  C  1992  John  Witty  A 
Sons,  Inc. 

I.  INTROOUCnON 

Periodic  gratings  have  long  attracted  a  great  deal  of  interest 
from  both  the  optics  and  microwave  communities.  To  this 
day.  they  remain  a  popular  choice  in  a  variety  of  applications, 
including  frequency  scanning,  angular  sensitivity,  and  polar¬ 
ization  sensitivity.  Accurate  high-frequency  solutions  to  the 
problem  of  diffraction  from  gratings  did  not  become  available 
until  the  arrival  of  digital  computers.  Meecham  was  the  hrst 
among  many  to  apply  a  variational  nethod  to  the  study  of 
perfectly  conducting  gratings  [1].  Later  on,  Neviire,  Cadilhac, 
and  Petit  devised  a  conformal-mapping  technique  to  analyze 
dielectric  gratings  [2].  Recent  studies  of  gratings  have  focused 
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on  design  optimization  [3]  and  the  characterization  of  lossy 
absorbers  under  oblique  incidence  (4]. 

In  this  article  we  present  a  numerical  approach  to  the  study 
of  two-dimensional  periodic  gratings.  We  hrst  use  the  bound¬ 
ary  integral  method  to  characterize  the  rather  complex  con¬ 
tour  of  a  sin^e  groove  (or  period)  of  the  grating.  The  un¬ 
derlying  periodicity  of  the  infinite  structure  is  then  accounted 
for  by  applying  periodic  boundary  conditions  and  expanding 
the  fieith  in  terms  of  Floquet  harmonics.  Details  of  this  ap¬ 
proach  are  presented  next,  followed  by  a  presentation  of  both 
numerical  and  experimental  results. 

I.  FOmiULATIOM 

The  geometry  of  the  grating  shown  in  Figure  I  is  a  blazed 
grating  used  for  frequency  scanning  purposes,  in  this  example , 
the  triangular  grooves  are  spaced  a  length  p  apart  in  the  x 
direction.  In  addition,  a  narrow  flat  spacing  of  width  w  is 
included.  A  plane  wave  of  either  polarization  is  incident  upon 
the  infinite  grating,  resulting  in  energy  being  diffracted  in 
many  directions.  In  genera],  the  period  of  the  grating,  p, 
determines  the  number  of  beams  or  modes  of  diffracted  en¬ 
ergy,  and  the  shape  of  the  groove  determines  the  amount  of 
energy  diffracted  into  each  mode.  Only  the  two-dimensional 
case  is  considered,  meaning  that  neither  the  structure  nor  the 
incident  field  has  z  variation.  The  fundameiital  task  here  is 
to  determine  the  scattered  field  due  to  a  plane  wave  incident 
upon  the  grating,  and  from  thb,  evaluate  the  performance  of 
the  grating. 

Because  all  the  grooves  of  the  grating  are  equally  spaced 
a  length  p  apart,  it  is  sufficient  to  consider  just  one  period  of 
the  whole  structure.  Shown  in  Figure  2  is  a  single  period  of 
the  grating.  The  approach  taken  here  is  to  subdivide  this  single 
period  of  the  grating  into  a  top  region  containing  just  free 
space  and  a  bottom  region  containing  the  contour  of  the 
groove.  Each  of  the  two  regions  will  be  handled  separately. 
For  the  sake  of  brevity,  only  the  /f-polarization  case  (magnetic 
field  parallel  to  the  z  axis)  is  described  here.  The  E-polari¬ 
zation  formulation  closely  parallels  that  of  the  H  polarization. 

In  Region  I,  the  magnetic  field  H\  along  the  aperture 
boundary  can  be  expressed  in  terms  of  the  tangential  electric 
field  (E  X  ^)  [or  equivalently,  a  surface  magnetic  current 

»  f  ■  (E  X  y)].  Due  to  the  plane-wave  incidence  and  the 
periodicity  of  the  structure,  the  expression  for  H\  can  be 
written  into  a  sum  of  Floquet  harmonics  (or  plane-wave  spec- 
oa)  as  follows: 

Ff'(x)  =  2«”‘(z)  -  2 
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periodic 
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Rgura  2  The  boundary  integral  regions  for  a  single  period  of  (he 
grating 


In  Eq.  (1),  is  the  magnetic  field  of  the  incident  plane 
wave  along  the  aperture  boundary,  is  the  r-compo- 
nent  of  the  incident  wave  vector,  and  ky^  = 
-  (Bj*  +  Imtipy.  By  applying  the  standard  method- 
of-moments  procedure,  Eq.  (1)  can  be  expanded  via  pulse 
functions  and  point  testing  into  a  matrix  equation: 

\H\]  =  I2//r]  +  [KU^xJ.  (2) 

where  is  an  aperture  admittance  matrix  containing  the 
contributions  due  to  the  summation  in  (1). 

Region  II,  unlike  Region  I,  is  completely  enclosed  by  the 
aperture  boundary  and  a  number  of  other  boundaries  shown 
in  Figure  2.  Thus,  through  the  equivalence  principle  [S],  the 
fields  at  any  point  along  the  boundary  may  be  written  in  terms 
of  an  integral  over  the  tangential  fields  (or  the  equivalent 
surface  currents)  along  the  entire  boundary: 

H'KQ)  -  \ 

J  emm 
bomdary 

X  cose  H\^Hkr)m  -  Y  H?H*OA#x]  (3) 


Jo  kyP 


\  ' 


npural  Profile  of  m  infinite  grating  with  triangular  grooves 


Here  is  the  z-directed  magnetic  field  at  a  point  Q  on 

the  boundary,  r  is  the  distance  between  the  source  on  the 
boundary  and  point  Q,  and  e  is  the  incline  angle  of  Q  with 
respect  to  the  inward-pointing  normal  fi  at  the  source  point. 
Equation  (3)  can  also  be  expanded  via  pulse  functions  and 
point  testing  into  a  matrix  equation  and  converted  to  an  ad¬ 
mittance  form; 

(//«'}  =  (T“](AfJ, 

along  the  entire  boundary  of  Region  II.  (4) 

Because  of  the  additional  boundaries  comprising  Region 
n,  (4)  contains  more  unknowns  than  (2).  Equation  (4)  can 
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be  reduced  to  include  just  those  points  along  the  aperture 
boundary  by  enforcing  the  following  boundary  condiiiorts: 


Af,  »  0. 

along  conducting 
wall  of  groove. 

(Sa) 

tf,(0,y)  *  H,{p.y)e 

along  the  open 
boundanes. 

(5b) 

MAO.y)  »  -M.ip.y)e 

along  Che  open 
boundaries. 

(5c) 

(5b)  and  (5c)  are  the  so-called  periodic  boundary  condmons. 
When  the  conditions  in  (5)  are  substituted  into  (4),  an  ad¬ 
mittance  relation  between  the  aperture  magnetic  held  and 
aperture  magnetic  current  is  rendered: 

i«i'i  “  mm-  <6) 

where  (//'')  and  jA#,}  are  the  magnetic  fteld  and  equivalent 
surface  magnetic  cunents  along  the  aperture  boundary  of 
Region  II. 

The  two  vertical  boundanes  along  which  the  penodi<. 
boundary  conditions  of  (5b)  and  (5c)  are  applied  have  been 
includei^  to  enable  the  ^-polarization  case  to  be  studied  with¬ 
out  resorting  to  higher-order  basis  functions.  Choosing  the 
aperture  plane  above  the  grating  also  results  in  improved 
numencal  stability. 


With  the  admittance  reiatton  for  each  of  the  two  rcpom 
in  hand,  the  unknown  currents  along  the  aperture  can  be 
found  The  total  tangential  hekh  along  the  aperture  boundary 
of  Region  I  are  matched  to  those  of  Region  II  Performing 
this  match  using  (2)  and  (6)  and  taking  into  aoooum  tfur  change 
of  sign  between  the  magnetic  currents  just  above  and  below 
the  aperture  boundary  results  in 

« i\y%\  *  H2wri  o> 

The  strengths  of  the  various  Roquet  hatmontcs  escited  by  the 
grating  may  then  be  extracted  from  these  currenu. 

M.  WMLXn  AND  CONCLUSIOf« 

The  formulation  presented  in  the  previcms  seaion  was  valt- 
dated  extensively  for  blazed  gratings  One  means  of  observing 
the  frequency  scanning  charactertstics  of  a  blazed  grating  ts 
through  Its  retroreflection  efficiency  curves  for  varsous  dif¬ 
fracted  orders  The  cfhctency  of  a  given  order  is  the  fraction 
of  the  total  incident  energy  contained  in  that  order  Retro- 
reflection  (or  backscattenngl  occurs  when  the  angular  devia¬ 
tion  between  the  inadeni  wave  and  the  diffracted  beam  is 
zero  Specihcaily,  the  angle  of  retrorcAcction  for  a  certain 
order  is  a  function  of  wavelength  and  is  given  by  P  « 
sin  '{(  where  n  is  the  diffracted  order.  A  is  the 

free-spact  wavelength,  and  6  is  measured  from  ihe  normal. 


H-poIaruaiion  (S-pol) 


FIgwra  3  Retroreflection  efficiency  curves  for  the  triangular  grating:  (a)  £  polarization,  and  (b)  H  polarization  (di 
p  »  25  fun) 


■) 

ds  -  54  736*. 


Exoeltcnt  agreement  is  observed  between  our  calculated  re¬ 
sults  and  published  dau  in  [7]  for  gratings  of  various  blare 
angles  for  the  n  *  -  1  order. 

Next  tiK  calculated  results  were  compared  against  a  set  of 
measure  RKnts  for  a  blazed  grating  designed  for  infrared  spec¬ 
troscopy.  The  grating  was  fabricated  by  first  etching  triangular 
grooves  into  a  silicon  substrate  and  then  depositing  a  thin  gold 
coating  onto  the  etched  surface.  The  nominal  dimensions  of 
the  grating  arc  p  *  25  pm  and  ^  =  54.736*.  Because 

of  the  fabrication  process  for  this  particular  grating,  there  is 
a  small  flat  spacing  of  about  w  =  3  pm  between  each  groove 
Shown  in  Figure  3  are  the  retroreflection  efficiency  curves  for 
both  polarizations.  Wavelength  is  restricted  to  between  9.0 
and  11.0  pm,  and  three  diffracted  orders  arc  observed  (n  ^ 
-  3,  -  4.  -  5).  This  corresponds  to  retroreflection  scan  angles 
lying  between  34*  and  78*  from  the  normal.  Agreement  be¬ 
tween  the  predicted  and  measured  results  is  quite  good  for 
both  polarizations.  In  panicular,  agreement  for  the  E-polar¬ 
ization  case  holds  well  over  a  large  range  of  scan  angles.  To 
serve  as  a  contrast,  the  results  for  an  ideal  grating  (i.e..  w  * 
0)  are  also  included  in  Figure  3 .  The  flexibility  of  the  boundary 
integral  approach  and  its  reasonable  computational  efficiency 
in  solving  periodic  structures  allow  it  to  serve  as  a  useful  tool 
in  the  evaluation  of  novel  grating  structures. 
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PHASE  GRATING:  ANALYTICAL 
FORMULAS  FOR  THE  NEAR  FIELD 
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Phase  gratings,  array  dlammators.  irradmnce  shaping,  opne^  mser- 
connects,  diffractive  optics 

ABSTRACT 

We  present  two  simple  analytical  formulas  ritai  deserthe  Ihe  trradi- 
ance  distribution,  at  ctrmn  distances,  of  any  phase  gratmg  Our 
general  results  are  illustrastd  mth  two  novel  esmnptes  the  irradumc* 
distribution  at  the  near  field  of  a  tovr-frequency  btast  grmmg,  and 
that  of  a  phase  grating  wtUi  a  tnaagular  profile  t  199}  lohm  wary  4 
Sons.  Im 

1.  MTHOOUenON 

Diffractive  optical  elements  ate  powerful  tools  for  addressing 
light  beams  into  several  prespecifted  directions  This  address¬ 
ing  capability  can  be  performed  with  substantial  reduction  in 
size,  weight,  and  cost  Furthermore,  by  employing  phase 
structures,  the  difftaaivc  elements  can  work  with  high  light 
efficiency 

Consequently,  phase  gratings  find  many  useful  applications 
in  modern  optical  instrumentation,  for  exanpic.  in  optical 
scanning,  optical  interconnects  |!j.  in  setting  array  illumina- 
tors  {2.  3],  in  optical  interferometers  [4).  and  for  optical  de¬ 
coding  (5). 

Our  aim  here  is  to  present  two  simple,  analytical  formulas 
that  describe  the  irradiance  distribution  at  certain  distances 
from  any  phase  grating.  Our  general  results  are  applied  to 
discuss  the  irradiance  distributions  created  by  the  use  of  a 
low-frequency,  blaz6  grating,  and  by  the  use  of  a  phase  grating 
with  triangub'  phase  profile. 

In  Section  2.  we  discuss  some  bask  results  of  diffraction 
theory.  In  Section  3.  we  present  our  aruilytkal  formulas.  And 
in  Section  4.  we  discuss  two  novel  examples. 

2.  CNFFRACnON  THEORY 

As  is  shown  in  Figure  1 ,  we  assume  that  the  phase  ^ring  is 
illuminated  by  a  plane  wavefront  that  propagates  along  the  z 
axis.  Thus,  the  amplitude  just  behind  the  grating  is  equal  to 
its  amplitude  transmittance,  which  is  written  as 

r(x.  y)  =  expli^x)]; 

=  S  exp(i2irmx/d).  (1) 

mn  -* 

in  Eq.  (1).  the  phase  profile  is  denoted  as  ^x),  and  d  is  the 
fundamental  period  of  the  phase  gratmg. 

Under  the  paraxial  aj^roxi matron ,  the  wavcficld  behind 
the  phase  grating  can  be  expressed  as 

» 

m(x,  y.  z)  *  2  z)exp(i2iimx/i/)  (2) 


‘Periraneni  address:  Lab.  de  Senicondiictores,  UAF,  Apdo.  Postal 
1651,  Puebla,  72000.  Pue.,  Mexico. 


MICROWAVE  AND  OPTICAL  TECHNOLOGY  LtHERS  /  Vol  5.  No  10,  September  1992  483 


IEEE  AP  S  Symposium,  pp.  305*306.  Chlcioo,  it.  ^  1992. 


MMnucAL  BonucnoN  cotmcmm  tot  tb 
cora»txrrot.SAC»xo  onttcnuc  ■Aur-njua 


IteLkc 


lUMvmtoi 

AMkktX 


ItnS-lOM 


DiAiafaa 


iM 


IKTRODUCnON 

JJianttiai  ranffri— i.pHfpwdfcyHiBMHiHl. 
nwOng  dM  tppWoMhfy  gf  ll»  UiUm  Thteiy  tt 
pgr*  MM  WwMtinirTPft— WMfciDwtf  far 


i.fafaplifgbatyio 


PW»Mm  «  OMf  phyite*!  h 

■MdMfaM  MMl  faUt  faMnetiau.  ImKtp.  Ham  12) 
dMfaMthMeot<BdMMOf»MMMfali^iBd|pM<Mid^  - 

KMMfaH  ftmi  SjHfa^pMtoSc^B^  ffMlWfa,  gfMBCSiMdMM 

ih*  inpofUM  fall  phyod  bp  iImm  wnii  fa  ndv  enoi  MOtea  (RCS)  ladHeriea 
Mdwlqiiw.  Um  If  faiiMpidinoibeHnfa>poMdfa(M(JBC)iMmtaHfaaaMMian« 
lOoMOlp  the  •aamifaMi  faMdMMc  booadMp  eMMfate  (OB!)  ippnabHifai  fa  fac 
faidy  of  eoMd  menm  (34)  hM  OM  wbfc  aBCb  Moem  ffaoD  Mifad  w  fafa,  Htft 
cntiM  eoutafa.  Tho^  the  IBC  bn  moim  faetf  Mber  latuM.  fa  fapllribflltp  n 

oHR  fiDBtil  cMdnp  «m  BHdiio  bi  vtSfaMd. 

In  fail  piper.  ■  oMfaod  1$  pMunfad  far  maetfag  fat  nonHikU  dMfaafae 
coeWctenti  for  toMid  odga  iifa  nMe^  i***?-  Wtfaoa  HMiwfa|  »  «  ISC 
ipprostsBfltiOBf  4H  djict  kMiypiKS  of  ibt  commS  Is  vMd  (9  diis 

bait  of  nwnriicil  ogeWeten  wtiacifan  Oonuwp  fajpavioai  mifa  |1  Jt)  fa  addeb  fan 
diffriciioo coellldenM wtn eianeud bp  wMim IfaMpnUiiw  rwiliilM nfakfafa 
icMtrrfag  MMii,  Ml  lohM  fae  •cai>falMR  pio&a.  ttdi  to  ucaanlhbid  bp  afelfa 

(fall  dtoctcdzKto^deawin'InwIm  ii^i  (fatoi  itgioa  mm  fai  dHSicdM  eaMr. 
Namericil  difltaEiien  ciilBctonB  mi  emacMd  fain  Mapknd  widifa  •  UTD 
ftiimfrrwb  miMwict  ndurioni  to  Ifabe  toad  coifad  MTuciMai.  AgMiiMM  bMOMn 
dito  Nincian  ud  w  nict  oMoieai  nwdiod  wfadon  far  fai  Itatoi  confad  ■wicimi 
wigyM  dyrte  mhad  to  i  fapraneb  lo  fae  drndlid  aidp  of  fan  cwnplK 
•CMMrfad  pbptoci  ocrantag  fa  cofafa  fVBcnm. 

BOWDARY  OnCtiML  MEIHOD 

Ufa  eoued  nrip  caafigantioa  to  faawB  to  Pfa.  1.  A  conifat  wifa  Miiiriil 
pmoeten  Ow)  ad  faictaicn  d  extott  M  top  If  Ml  fandMlp  fafa  pemciip  cmdMteg 
•nip.  Our  tpproicb  iwiUi  Dm  dividfag  fae  problem  gicuBp  faio  dM  mpvfae 
iC|Nai(McFig2).  Bpipp|pingd»iqBfaUmeepftacfaM.fai(ieIdtoiicliMgfaiGn 
bcmliiidtofaeiiafimdiilleldialeagfaeboMidiip.  Ibe  iMoiifaf  m  of  boondMp 
famgril  HBMioni  cm  be  dfacmdaid  fate  ■  mt  of  BMtrfa  egnuliw  rto  fan  nMfaed  of 
momena.  StaM  fae  boimdiitoi  of  lO  fame  ioeiaweamdMtoAfap.il  to  aeeMMqr  fa 
■GoooM  for  fai  PO  Aide  md  wrffai  orom  flehto  Of  faurni)  tt  tofaMfe  eeofaiiiafa  or 
dM  Mill  llildt.  ByptoetafiempleiffiaifafaiiifabiMeUMfaiocbboMfafpfa 
dfacribc  dM  PO  md  lafaee  wore  Itoldi,  fai  MOfafaitalto  boMfaMtoi  nap  be  OMHoad  ■ 
I  Afae  dtoMnw  ftom  fae  edfi  of  fae  mip  orfaem  fae  odpe-faflfaoid  (toldi  becaoK 


irifafa  I  UTD 


HUMEXICAL  DIFITUenON  CDCPIK^ 

Oaoe  dtt  beandap  (toldi  mo  fmnd  to  lotvbm  fae  m  of  oMUfa  oqnodeoe,  dw 
iwaemd  fleld  toenilp  dmirttoed.  TtodtfwicfadSeldtofamieIttidtofaeboMderp 
faiiyil  mdemd  (told  drpindtod  on  ofaefair  die  icfatomd  (told  to  icwubi  obiimMim 


30S 


N 


MiM  oonnint  Ac  •conevicai  OBte  <GO)  inektaat  Md  iBfleewd  Addi  or  Mrftec  wave 
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maopucnoN 

The  elecminigiieiic  cncrn  hiriririiiried  fkaa  aa  Mkaowa  ttrga  can 
provide  infannaiiaa  iiaerhl  for  clattifyiag  and  ideaiifyiag  ihe  tancL  Tidt  ii 
oooaiioaly  accomplithed  iaieipRtiiig  the  hariticaiwied  letpoue  m  eidter  die 
toe  or  Re  ficque^  domain.  For  laigetdiaiacieriaiei  which  am  not  famnetoiely 
apparent  in  cither  the  tune  or  the  A^uency  domain,  the  joint  dme-ftequency 
icpreacnuiioa  of  the  ladar  echo  can  aomenmet  pnr^  more  iniigbt  into  the 
acatitring  medianitini.  In  a  nocnt  proer  by  Moghaodar  and  Wahontl].  the  naming 
window  Fourier  traniform  (R\^^)  (alao  known  at  the  iliort>iiine  Fourier 
Mntform)  wu  applied  to  the  backscanered  frequem^  reapome  from  an  open- 
ended  waveguide  cavity  in  order  lo  obtain  a  joint  nme-ncqueacy  leercteataiion  of 
thearigiaalai|nal.  In  the  dtwe-fiequeocy  piane,  localiaBd  evenia  in  nme  (acaneiing 
otttiB^  localized  evena  in  fraqoency  (tcaoninccs).  aa  well  aa  diapcrave  acaaering 
mechaniams  can  be  amailiaiieoualy  diiplaycd.  Conaeqiieatly.  gm  inaightt  were 
gained  on  the  difTcrent  tcattering  mechamama  which,  when  property  interpreaed. 
can  be  ooRdaicd  with  the  acatierer  feaiurea  tor  avget  idcndficadan  ippiicaiiont. 

However,  there  ia  one  iniiindc  limitaDcn  atfocieied  with  the  RWFT.  The 
fcencred  signal  in  the  frequency  domain  is  generally  comprised  of  scanering 
mechanisms  with  widely  aifTereni  chirscteriatic  acalea.  For  example.  high-Q 
leaonance  phencmeim  are  highly  localiicd  (or  amall-acale)  eveiwa  in  frequency.  On 
the  other  hand  ooniributiona  frto  non-dispenivc  icaiicring  ceaien.  which  appear 
u  sharp  peaks  in  tiine.  extend  over  large  fieqncncy  ranges  and  are  large-scale 
events.  Since  the  RWFT  uses  a  fixed  window  width,  it  cannot  resolve  multiple 
events  with  difTeient  scales  simultaneously.  In  this  paper,  the  wavi^i  analysis  of 
backscaitering  dau  is  presented.  The  wavelet  oansfora  has  anneted  raudt 
anemion  in  the  signal  prooessiag  oommuaity  meeaify  {2-5].  In  oontrast  to  the 
RWFT.  the  wavelm  Banaform  lei^ique  has  muid-maolwaon  property  through  the 
nae  of  multi-acale  windowa.  Therefore,  muiti-icalc  eveatt  in  the  frequency  data 
can  be  mom  effectively  reaolved  in  the  dme-frequeacy  plane  by  me  wavciei 
srinaferm.  Retultafcranepen-cndedwavnaideeavityaadadimcnivepUana 
cylinder  am  pmaemed. 


WAVELET  ANALYSIS 

We  define  the  waveki  oaiufiorm  cf  a  frequency  signal  F(«)  at: 

W|(t4J)  -  J  F(«s)  t*o  H  (T(m-0))  dca  (la) 

H(«)  ii  usually  tefened  so  as  die  'tothcr  wavelet'*  Nete  that  the  wavdei 
Banaferm  it  defined  in  the  frequeney  domain  for  our  applicaiion,  conimy  so  its 


osiMl  defiBitkM  ia  the  tfane  dcmeiB  Bm«iM  (la)  am  be  iwerpeeied  at 
deciowpotiiioB ef  the  fteycy  lewoete  rtw) iK>  tfmifyaf  ihiftwl  imi  Mmi 
wavdeti  ll(T(«-a)).  By  ri^ftaulKii)  with  a  filed  Kale  iiafiMliret^ettaala 
aecliaitiiiBt  ie  the  fteqneacy  letpoeae  F(m)  cae  be  eiaaciad  lad  loealiaai. 

Aberaatively.  by  dila^  H(m)  at  a  fiaad  o.  aU  of  ri»  aaahMcale  teaM  of  B(«)  ■ 
OcaabeaB^aedaoeeidH«ia*caealeHnBMa(re  11iHisihe»«BDcd*‘MM- 
rttolution’'  property  of  the  wa¥ckt  maifora  tad  it  aa  iaycaft  ahwaUfa  eeer 
theRWFT. 

By  pnpcdy  naaipalaiiiig  (laK  Bm  wavelA  aaaAm  caa  aiiB  be  caiM  eat 
OB  the  liaK  dooMBB  li^  fit)  of  *e  ta^jaal  taqaeaey  doaaiB  mpeaK  F(i»): 

Wf<t^)-  /  f(t)  t-'»b(t/e)  eJ“*  *  (lb) 

h(OitibeFuuiief  Baai<iiaatafH(a)aadaBMtiabiftfthe*a<MiwlWHiyenaJMoa.* 
Le..li(0)-0.  Siace  (lb)  U  cwm^tly  the  Foarig  iiaatfam  of  Wt)h(tft)l.  h  la  the 
piefcHednaB>erieal»-T**"**"“‘*”^****"~“**^*”**^**'‘**“l^  **"***"**** 
nT.  WithibedcfiaWaaia(lb).«ecaialMiaicnmoorami«aooafgrBriagfiw 
wavelet  tnatfonnliaai  the  titaedoaHiayaiat  of  view.  Thebadocaaiiediiipoaw 
a*  a  fnactton  of  tiaie.  or  the  totalled  faaae  pofile.  tyaically  oaoiiMi  of  ahary 
peaks  in  the  earty  tiaie  followed  by  nail  nagiag  ia  the  laie  laae.  Very  fiae  ria» 
letoltMiaii  it  aeeded  to  lesolve  the  vaioat  acaaeriag  ceMCft  dviae  dM  aarty  dm 
and  food  frapwacw  lesoiwion  (or  coane  tane  letoiadoa)  it  aaedtd  tor  Maaid  *e 
tartei  icsonaacet  m  late  date,  ftoai  (lb),  we  caa  lae  dial  Kt^)  it  a  vaitobie  dav 
window  with  its  width  dctenaiaad  by  t.  In  the  cariy  date  (nail  r)  the  dm 
window  width  it  naDaadieovidet  PM  dole  leaohaiaa.  ladKlmdm()ar|Bt) 
the  dm  window  width  it  lafpe  and  provides  food  taq^ay  molntiea. 
Tbetefcre.  the  variable  dm  icaolndoa  piaperty  of  dw  wavelei  aannm  it  idtafiy 
suited  for  analytinf  dm  dau  which  ooataia  eariy-dm  aceMeriag  oaoitr 
hifannadan,  iaienneAa»dm  diM'*!'*  OMchanan  tBdl8iB<dmiaiaaaacat. 


wa  VFI PT  aCTMaevTATiON  OF  ■AaearATiiMwn  data 

The  wavelet  i^ieteiwadoa  ^  the  bacfacaneriag  dm  ftooa  aa  opaamM 
wavqpridc  cavity  it  contidend.  The  cavity  it  aa  open  Hided  cacalamundde 
with  a  diaaieier  of  1.73  in.  A  flat  ooadaedag  tcradaadon  eaiait  2  It  iama  lha 
waveguide.  To  generate  the  backteaneriag  dun,  dw  ladw  emt  tectiaa  of  ddt 
target  is  lint  cotaputed  in  the  frequcacy  domain  We  take  iaio  aoeom  of  Ae 
interior  cavity  coatribuiion  atiag  a  modal  approerh  m  aad  dm  dUbaedea 
oontribudon  Cram  the  ftoat  rim  of  the  cavity  aBiM  the  aayatpiedc  fiamaln  in  IT). 
The  tim-dotnaia  letponic  it  dm  nbtained  by  Foarier  tiaaafiaaadag  dke  bead* 
Unatedfiequcacydata.  Rg.  1  shows  the  dBm-fietpmacywevalairapaaaaowiaa  of 
the  baducanering  data  at  aoimal  incidence  (bdr).  Tim  emvelei  aaaafem  it 
hapletneaied  utiiig  equadoa  (lb)  with  the  aid  of  the  FFT.  The  flaKdea  h(t)  it 
chotca  to  be  a  two-tided  Kaiaer-Bcsaei  window  wMi  rnQ^tmtfOiS.  (He  0- 
factor  is  definedaaM  ratio  Imrwacn  dm  awdow  widdi  and  dm  wiadear  oaaiat.) 

mode  can  be  cleariy  idendSad. 

Next  dm  backteaneriag  fiam  a  dimcriive  plana  cyHader  it  amdiad.  tie 
cimlar  piatra  cyUader  hat  a  radiut of  11.94 can.  AoeMplaim mitl Wit 


•nanadianodeliwqrliiMfcrMattipcnivadkiMriewMiftsIl’iV^- 

fHMM  dM  backfcMcriiit  ftvqucncy  ictpoMc,  **  digeafnciicai  foMoi  m  te  « 
dfcular  dicketric  cyliiMlcr  is  impkmwmd  — warfcally.  Tlw  mMM  ttes- 
AMtMaey  floi  obttiMd  wiat  tiw  wavakt  oaasfsns  is  Aqm  is  ng.  2.  Two 
tBiwnn  rswiTMH  nwrIiMisni  rt*  hr  «- 

Mm  das  IP  As  suarlor  spsoilv  tfflecdon  iH  cyttsdtr  Md  a  disgantai  e«v«  dse 
IP  the  nwra  inwfinily  laOaciad  cowaribadM  ftoi  d»  soaray  paaawaad  awo  dk 
cyModer.  Below  dw  ptona  bcqaeacy,  dK  plana  it  conn  ip  wa««  yaoniiakai 
iM  tfM  bcdttCMBinf  oQOttf  solely  Aoitt  Ac  flRMm  CMBiiof  icflccdoB.  AlHvtAc 
pkflBa  Aoiiaci^.  dK  piaMia  cyMndor  it  yaniaBy  MMoabia  aad  dH  oaDa^BMMMy 
icflaeiBdoanaibMHBityraamL  Ukonci  iaieiMliyiartacartwa»antaiiaMi« 
die  duneatr  of  dw  cyModer  iliRM|iidttdiiyennepla«M.  SiaoedMaiMitwclochy 
of  dw  wave  ia  dK  idasiiia  is  praponioaai  IP  die  nvd  toe.  dM  carw  10  the  wMetn 
yfauM  io  fact  lepicteott  the  diversion  diairam  of  dai  ylaimt  hoaa  dM  two 
caamplet  oootidcrad  io  ihit  pifcr.  it  it  oeaeivad  dM  the  wavcki  muufeim 
yeoeidet  good  icaohidoo  io  twadmieoiitiy  ideodMag  die  acaneriM  e  loiiri  aod 
fcaolving  die  fctooaoi  yheoaneoa  of  the  latyet.  Mdk  adequMeiy  dcafTitog  to 
duyenive  tcaMriM  oMchtoitait  io  to  NMnoadiaai  lagioo. 
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Image  Interpretation  Using  Multiple  Sensing  Modalities 

Chen-Chau  C  hu  and  J.  K.  Agganval 


Absinci — This  paper  presents  the  aulomalic  interpretation  using  mul¬ 
tiple  sensors  i.AI.SISi  system,  which  is  an  automatic  image  interpretation 
.system  using  registered  laser  radar  and  thermal  images.  Its  objective 
is  to  detect  and  recognize  man-made  objects  at  kilometer  range  in 
outdoor  scenes.  The  mubinmor  fusion  approach  is  applied  to  four 
sensing  modalities  (range,  intensity,  velocity,  and  thermall  to  improve 
both  image  segmentation  and  interpretation.  Low-level  attributes  of 
image  segments  (regionsl  are  computed  by  (he  segmentation  modules  and 
then  converted  to  the  KEE  format.  The  knowledge-based  interpretation 
modules  are  constructed  using  k£E  and  Lisp.  AIM.S  applies  forward 
chaining  in  a  bottom-up  fashion  to  derive  object-level  interpretations 
from  databases  generated  by  the  low-level  prtKessing  modules.  Segments 
are  grouped  into  objects,  and  objects  are  classified  into  predefined 
categories  using  selected  features.  The  efficiency  of  the  interpretation 
process  is  enhanced  by  transferring  nonsymholk  processing  tasks  to  a 
concurrent  service  manager  iprogram).  A  parallel  implementation  of  the 
interpretation  module  on  a  multiple-input  multiple-data  (MlMDi  machine 
is  also  reported.  Experimental  results  using  real  data  are  presented. 

Index  Terms —  Image  interpretation,  knowledge-based  systems,  laser 
radar,  mullisensor  fusion,  tbermal  imaging. 

I  Introdlltion 

This  paper  reports  on  (he  automaiic  interpretation  using  multiple 
sensors  (AIMSi  system,  which  is  an  automatic  system  for  scene 
interpretation  using  registered  laser  radar  (ladar  or  lidar  jll)  and 
thermal  images.  The  goal  i  t  the  system  is  to  detcci  and  recognize 
man-made  objects  (MMO)  at  kilometer  range  in  outdoor  rural  scenes. 
AIMS  consists  of  two  building  blocks;  1)  the  segmentation  modules 
for  lovc-level  processing  and  2)  a  ruic-ba.sed  inlerpretalion  system 
for  high-level  reasoning.  The  MMO's  in  our  test  images  are  mostly 
ground  vehicles.  The  background  is  composed  of  vegetation,  ground, 
and  sky.  An  earlier  version  of  AIM.S  using  only  laser  radar  data  was 
reported  in  |2j. 
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In  this  work,  the  mullisensor  fusion  (MSF)  approach  is  used 
to  integrate  information  derived  from  multiple  modalities.  MSF 
helps  to  improve  both  image  segmentation  (by  pixel-level  sensor 
fusioni  anc  mage  interpretation  (by  object-level  sen.sor  fusion).  MSF 
pursues  a  'nmatic  information  integration  from  different  types 
of  sensors,  multiple  observations  through  different  channels,  and 
different  processing  techniques.  Therefore,  vision  .systems  based  on 
MSF  can  provide  a  better  performance  than  that  of  monosensor  vision 
systems. 

The  knowledge-based  systems  (KBS)  approach  has  been  applied 
to  various  machine  vision  tasks  such  as  image  segmentation,  object 
recognition,  and  scene  interpretation  (3]-(6|.  Some  recent  research 
on  machine  vision  is  based  on  range  imaging  for  precise  .3-D 
modeling  and  understanding  17).  (8).  However,  both  (7)  and  (8)  use 
indtNvr  range  data,  which  are  usually  more  precise  than  data  obtained 
from  ouldoor  range  imaging  because  of  the  much  shorter  distances 
involved.  Ladar  range  and  intensity  data  are  used  in  XTRS  [9|.  which 
is  not  KBS  oriented.  XTRS  and  the  earlier  implemenlation  of  AIMS 
|2|  arc  not  able  to  use  thermal  images  as  input. 

The  ladar  images  used  in  this  work  have  three  inherently  registered 
components;  range,  intensity,  and  velocity.  The  thermal  images 
are  manually  registered  with  the  ladar  images  (such  that  the  pixel 
sizes  and  the  viewing  orientations  arc  approximately  the  same). 
Thermal  images  have  been  extensively  .studied  (10),  |ll|  for  image 
segmentation  and  interpretation.  Thermal  images  usually  lack  high 
spatial  resolution  and  convey  very  little  .3-D  information  about  the 
imaged  objects.  However,  thermal  images  provide  information  on 
the  material  composition  of  objects.  Therefore,  thermal  imaging  is 
an  effective  Kwl  to  detect  MMO's  and  complements  the  capabilities 
of  ladar  imaging.  Both  ladar  range  data  and  thermal  data  are  used 
in  [12). 

AIMS  uses  all  four  available  modalities  in  an  integrated  fashion  at 
both  the  segmentation  level  and  the  interpretation  level.  Several  seg¬ 
mentation  methods  are  implemented  for  the  four  sensing  modalities 
used  in  AIMS  to  extract  different  types  of  information.  Three- 
dimensional  geometry  and  object  .surface  structure  are  extracted 
from  ladar  range  data.  Intensity  data  provide  object  surface  reflec¬ 
tivity  information,  VeltK'ity  data  indicate  moving  targets.  Thermal 
images  provide  information  about  object  temperature  and  thermal 
capacitance.  AIMS  includes  an  integration  module  [13)  to  integrate 
multiple  segmentation  maps  with  user-supplied  weights.  The  image 
segmentation  modules  and  the  low-level  integration  module  use 
minimal  knowledge  about  the  problem  domain.  Various  segmentation 
(or  edge)  maps  arc  integrated  into  a  single  map  (13)  before  the  inter¬ 
pretation  starts.  Hence.  AIMS  has  more  segmentation  information  for 
scene  interpretation  than  systems  relying  only  on  information  derived 
from  a  single  source  or  a  single  feature  extractor.  AIMS  employs 
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Fig.  1.  AIMS  System  overview 

forward  chaining  (FC)  for  a  data-driven,  hottom-up  approach.  Inexact 
reasoning  is  used  as  the  mechanism  for  high-level  integration  The 
assertions  generated  from  various  rules  (knowledge  sources)  are 
integrated  in  the  form  of  consistent  interpretation  hypolhc.ses  and 
increased  confidence  factors. 

Fig.  I  shows  the  overall  structure  of  AIMS.  The  segmentation 
module.s  are  written  in  C.  whereas  the  reasoning  modules  arc  built 
using  KEE'  and  Lisp.  KEE  is  a  commercial  package  for  expert  system 
development.  It  provides  the  inference  engine  and  the  rule  parser 
in  .AIMS.  The  interpretation  process  starts  hy  checking  attributes 
extracted  by  the  image  segmentation  modules.  It  then  labels  each 
.segment  as  part  of  a  man-made  object  or  as  the  natural  background 
(I3G)  based  on  these  parameters,  Next,  segments  are  grouped  into 
(ihjccts  based  on  several  criteria.  Image  interpretation  rules  then 
generate  hypotheses  of  object  interpretations,  The  hypotheses  are 
strengthened  or  weakened  by  examining  more  evidence. 

II.  LaDAR  iM.Afih  StC'MfcNI ATION 

Ladur  uses  modulated  laser  light  as  the  illumination  source  for 
outdoor  long-range  sensing.  Ladar  images  arc  much  noisier  than 
images  from  video,  thermal,  and  indoor  laser  images.  The  random 
refraction  and  reflection  of  laser  light  in  the  atmosphere  and  on 
the  object  surfaces  generate  s[wklc  noisv.  This  noise  is  significant 
in  long-distance  outdoor  range  imaging  but  virtually  noncxi.stcnt  in 
problems  involving  indoor  range  imaging,  such  as  in  (7]  and  [fi|. 
Although  many  edge  detection  and  segmentation  techniques  have 
already  been  established  for  video  intensity  images,  few  such  methods 
have  been  established  for  ladar  images.  In  this  work,  two  region-based 
segmentation  methods  {surfatf  jiumg  and  pixvl  valiw  •natistics)  are 
applied  to  ladar  data.  The  surface  filling  method  is  designed  to  high¬ 
light  object  surface  geometry,  whereas  the  image  statistics  method 
is  used  to  delect  differences  in  pixel  value  statistical  properties.  A 
complete  discussion  of  these  two  segmentation  algorithms  and  their 
performances  using  ladar  data  is  reported  in  [2]  and  [14]. 

Most  man-made  objects  are  made  of  surfaces  representable  by 
patches  of  low  -order  surfaces.  This  assertion  is  practically  true  when 
the  distance  to  an  object  is  large  compared  with  its  body  dimensions 
as  it  is  in  our  task  domain.  Only  planar  surfaces  arc  u.sed  in  current 
AIMS  implementation,  although  the  module  is  capable  of  fitting 
surfaces  up  10  bicubic  polynomials  (15).  However,  high-order  surface 
fitting  i.s  more  .sensitive  to  noise,  which  is  abundant  in  our  problem 
domain.  The  surface  fitting-based  segmentation  algorithm  employs 
a  region-growing  approach.  Surfaces  are  fitted  to  segments,  and 
segments  grow  as  long  as  the  fitting  error  is  within  a  specified 
error  bound.  Surface  normal,  distance,  and  different  object  surface 
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materials  may  generate  difierem  speckle  patterns,  which  in  turn 
generate  image  areas  with  different  statistics  in  pixel  values.  The 
statistical  approach  is  al.so  applicable  to  range  and  velocity  data.  Fur 
example,  the  average  range  value  from  all  pixels  within  an  image 
segment  is  a  good  estimation  of  its  distance  to  the  sensor, 

III.  THFRMAL  iMACjt  .StGMfcNIATION 

Thermal  infrared  (IR)  images  have  been  extensively  studied  (10). 
(II)  for  image  segmentation  and  interpretation  because  thermal 
images  provide  information  on  the  material  composition  of  objects 
The  physics  behind  thermal  imaging  is  discussed  in  (16).  The  pixel 
values  in  thermal  images  are  usually  dominated  by  the  temperatures 
of  imaged  objects.  The  temperatures  are  determined  by  various 
thermal  properties  of  different  materials,  such  as  emissivity,  the 
thermal  capacitance,  and  the  heat  sink/souree  distinction.  Some  of 
these  pnipertics  have  different  values  for  various  object  surface 
materials  and.  hence,  can  be  used  to  distinguish  MMO's  from  natural 
background.  Thermal  sensing  can  overcome  visual  camouflage  and 
delect  objects  under  suitable  asvumplions  of  thermal  behavior  (e  g., 
most  objects  have  similar  emissiviiics).  Usually,  thermal  images 
pr<ividc  a  reliable  differenliarion  between  metallic  objects,  concrete, 
water  bodies,  and  vegetation. 

Uuantitative  analysis  of  thermal  images  is  dilficult.  however, 
because  the  heal  transfer  process  is  complicated  and  the  atmosphere 
iniroduccs  strong  absorption  and  random  refraction  of  heat  fluxes 
Consequently,  thermal  images  lack  conira.st  and  spatial  resolutions 
to  discern  details.  Therefore,  thermal  imaging  is  an  effective  tinil 
to  detect  MMO’s.  but  it  is  less  useful  for  interpretation  tasks  and 
quantitative  analysis.  Fortunately,  conlra.sls  between  materials  with 
distinct  thermal  capacities  arc  usually  strong  enough  to  separate  metal 
and  concrete  from  vegetation,  soil,  and  wikhI.  Besides,  as  opposed  to 
thermal  imagery,  ladar  can  acquire  structure  and  shape  information 
to  complenieni  thermal  imaging. 

Thermal  data  are  used  in  a  qualitative  way  in  AIMS.  High- 
lemperaiurc  pixels  indicate  heal  sources,  such  as  engines  and  exhaust 
pipes,  or  materials  with  small  heal  capaeilanecs.  such  as  metals.  Ijow- 
lempcraturc  pixels  indicate  heat  sinks,  such  as  vegetation,  or  materials 
with  large  heat  capacitances,  such  as  water,  and  low-lcmperaturc 
areas,  .such  as  a  shadow,  Al  the  pixel  level,  thermal  images  provide 
region  segmentation  maps,  which  are  then  integrated  with  other 
information  sources.  Al  the  object  level,  thermal  data  are  used  to 
detect  metallic  objects  and  heat  sources.  U.sing  thermal  images  jointly 
with  range  data  helps  to  detect  covered  cavities,  e.g.,  the  pas.sengcr 
compartment  of  a  truck.  Glass  windows  of  vehicles  are  usually  cooler 
than  neighboring  metallic  parts.  This  conjecture  can  be  verified  with 
weak  laser  return  intensity  data  and  low-error  planar  surface  lit.  Hot 
exhaust  pipes  and  engine  blocks  can  confirm  moving  targets  with 
segmentation  derived  from  ladar  velocity  data.  Most  thermal  imaging- 
based  research  on  target  recognition  are  signature  oriented.  However. 
AIMS  is  not  limited  to  the  signature-based  recognition  strategy  since 
ladar  imaging  provides  .^-D  structural  information  and  better  spatial 
resolution. 

Thermal  images  have  different  characteristics  from  video  images. 
Hence,  different  low-level  processing  techniques  are  needed.  For 
example,  the  Burns  edge  detection  algorithm  (I7|  does  not  work 
well  in  thermal  images  because  of  overblurring.  The  quantitative 
analysis  technique  described  in  [  Ifl)  is  not  applicable  in  the  desired 
work  domain  of  AIM.S.  First,  regi.sicred  video  intensity  images  arc 
needed,  whereas  AIMS  currently  uses  only  ladar  intensity,  which  is 
significantly  different  from  ordinary  video  intensity  imaging.  Second, 
targets  in  the  test  images  are  on  the  order  of  I  km  away  from 
the  camera.  Therefore,  the  blurring  of  thermal  images  is  loo  much 
to  facilitate  the  computation  of  heat  flux  pixel-by -pixel  as  in  jlO). 
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Finall>  surtacc  normal'  are  assumed  perfcclly  knossn  in  |  Klj.  but 
!hc\  base  ui  be  estimated  irom  noisy  ladar  range  data  in  the  work 
domain  ot  AIMS  l  ig  2  shows  some  typical  thermal  images  (each 
lii.S  N  I'h)  used  in  this  research.  The  lop  row  contains  a  truck  (lell) 
and  a  truck  and  a  personnel  carrier  {right).  The  bottom  row  contains 
two  trucks  and  a  bulletin  hoard  (left)  and  one  truck  occluding  the 
personnel  carrier  (right). 

.-\  piipular  approach  lor  IK  segmentation  is  lliri-sholdiiii’. 

It  distinguishes  between  background  and  MMO's  using  the  histogram 
of  pixel  Values,  assuming  that  pixel  values  are  derived  from  a  bimodal 
distribution  tunciion  The  IR  images  used  in  this  research  satisfy  this 
assumption.  INually.  .1  l.irge  cluster  exists  for  low -temperature  pixels 
(vegetations),  and  a  second  cluster,  which  is  much  smaller,  contains 
high-temperature  pixel'  (metallic  objects).  In  the  lest  images.  MMO's 
usuallv  occupy  less  th.m  l.s'r  of  the  total  number  of  image  pixels  and 
exhibit  higher  temperatures  than  those  of  the  background  vegetation. 
All  the  images  have  open  held  or  trees  woods  as  a  background.  The 
field  is  covered  with  grass  When  the  central  portion  o*  IR  images 
is  resampled  with  the  s.ime  spatial  resolution  as  the  ladar  images, 
usuallv  'Xf  r  ot  the  pixels  belong  to  the  background  in  a  resampled 
12b  »  250  w'indovv.  For  an  objeel  5m  long  and  2  5ni  high,  the 
broadside  v  iew  takes  about  121';  of  the  window. 

One  can  assume  that  ail  the  different  thermal  eharaeieristics  of 
background  vegetation  can  be  approximated  by  a  Gaussian  distri¬ 
bution  ot  pixel  values  The  Gaussian  bell  of  background  vegetation 
is  hxjled  at  the  low  end  of  the  histogram  because  hardly  anything 
is  cooler  than  the  background  vegetation  (except  shadows  and  the 
sky)  The  peak  of  this  Gaussian  bell  is  also  the  peak  of  the  entire 
histogram  because  the  background  dominates  the  entire  image  (Fig. 
.5),  A  segmentation  scheme  is  designed  based  on  the  above-mentioned 
observations  and  assumptions.  The  standard  deviation  n  of  the 
Gaussian  distribution  consisting  of  the  low -temperature  pixels  is 
determined  by  solving 

— t -  II  y  , 

It.  I  =  cxilt - - -  I  (1) 

2rr- 

where  ’  is  the  .s-db  width  of  the  Gaussian  bell  In  a  Gaussian 
disiribuiior.  the  mcun  /'  is  the  same  as  the  rnf«/c  of  the  distribution 
and,  hence,  is  easily  determined  as  the  peak  of  the  ht.stogram.  .la.  is 
estimated  by  tracing  the  histogram  to  find  the  .sOG  drop-off  point 
from  the  peak  of  the  histogram.  Note  that  /<  is  not  determined  as 
the  average  of  the  enure  thermal  image.  All  pixels  with  gray  values 
covered  bv  the  range  of  (f),  p  -)-  o  j  are  classified  as  background:  all 
pixels  with  gray  values  in  the  range  of  (/r  -I-  To.  255 j  are  considered 
to  be  MMF).  Therclote.  the  thresholds  arc  determined  dynamically 
for  individual  images  Pixels  with  gray  values  in  between  are  then 
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ITg  4.  Thermal  image  segmcnljlion  results  Irom  images  in  Fig  2 


determined  by  their  proximity  to  classified  pixels.  However,  only- 
regions  that  are  large  enough  are  established  as  segments  to  reduce 
noise  effects. 

Fig.  .4  shows  the  histograms  of  tour  thermal  images  shown  in  Fig. 
2.  All  of  them  satisfy  the  assumption  ot  small  target  surface  areas  in 
a  large  background  full  of  vegetation  and  soil.  The  hisiogranis  clearly 
indicate  the  Gaussian  bell  distribution  at  the  lower  end  (left)  of  the 
plot.  Fig.  4  shows  the  segmentation  results.  The  execution  time  of 
the  dynamic  thresholding  algorithm  is  about  5  s  on  an  IBM  RT  and  is 
the  fastest  method  among  all  segmentation  methods  employed  in  this 
work  Note  that  the  segmentation  derived  from  th-.mal  data  usually 
cannot  detect  the  occlusion  contours  between  two  objects  of  similar 
thermal  properties  In  addition,  the  same  segmentation  algorithm  does 
not  perform  as  well  when  it  is  applied  to  urban  scenes  with  many 
concrete  structures  (high  thermal  capacitance)  in  the  scenes.  In  such 
cases,  the  histograms  no  longer  satisfy  the  a.ssumplions. 

IV  Tut.  In  n.(, RAMON  OK  SlXIMl  N  I  ATION  M  aPS 

High-quality  image  segmentation  is  a  crucial  intermediate  stage 
before  image  interpretation.  The  strong  noise  makes  the  inicrpreiaiion 
of  ladar  images  even  more  difficult.  Although  various  techniques 
have  been  proposed  for  different  sensors,  each  one  of  them  has 
respective  merits  and  weakness.  Different  methods  operating  on 
multiple  data  sources  generate  ditferent  segmentation  maps.  These 
maps  may  have  errors  and  possibly  contradict  one  another.  AIMS 
includes  an  integration  module  that  is  very  different  from  and 


Itl.I  TRASSACTIOVS  OS  PATIbKN  ANALYSIS  AN!)  M  \(  HIM  IM  l  1 1  lOfcSt  b.  VI 11.  14.  SO  ^  VK.I  SI  is>.; 


Th.n  lines 
as  input 


The  circle  determines 
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Thick  lines  as  output 

Fig.  5.  1.0'S -level  integration  of  segment jiion  euev. 


more  tIe.siMe  than  previous  works  developed  tor  similar  purposes 
(l.S)-|2H].  For  example,  it  is  helpful  to  applv  different  weights  on 
various  input  segmentation  maps  because  qualitv  and  reliability  of 
different  segmentation  methods  and  data  sources  may  be  differ¬ 
ent. 

The  low -level  module  (i.e..  using  no  high-level  knowledge)  in 
AIMS  (l.fj  integrates  both  region  and  edge  segmentation  maps, 
which  are  produced  bv  various  methods  operating  on  different  sensor 
outputs.  The  maximum  likeMuMtl  criterion  is  used  to  generate  an 
initial  solution  of  edge  pixel  positions  from  samples  collected  in  local 
windows.  With  the  assumption  ol  white  Oaussian  noise,  ihe  soIuikui 
'  is  in  the  form  ol  a  weighted  average  from  m  observed  vamples  . 


Y'"  ,'i 


(2) 


,ind  simiiarty  for  .v,.  L'ser-supplied  weights  may  be  considered  to  be 
the  etlccls  I'f  the  precision  of  various  observation  processes  and  can 
he  combined  with  ^,'s.  The  size  of  the  local  window  ihtis  decides 
the  sp.itial  scope  of  the  low-level  fusion,  and  it  olavs  a  role  similar 
to  ihv  cluiunri  ri  \<‘hiiu»i  wiilili.  Solutions  produced  by  neighboring 
windows  are  linked  lo  preserve  contour  connectivity  (fig  .s).  Tlic 
resultant  edge  map  is  represented  by  a  node-spring  model.  Ihe  edge 
map  IS  then  mi'ditied  lo  minimize  the  stored  potential  energy  within 
Ihe  nolle -spring  model.  Finally,  the  edge  map  is  converted  to  a  region 
map.  and  small  regions  arc  removed. 

The  current  implementation  of  this  integration  module  is  indepen¬ 
dent  ot  the  image  sources  and  segmentation  techniques.  Doth  region 
and  edge  maps  are  allowed  In  addition,  different  weights  can  be 
applied  to  various  information  sources  lor  difterenees  in  data  qu.itilv 
and  reliabililv.  In  general,  range  data  are  not  as  noisy  as  iheir  imensily 
counterparts  and.  therefore,  are  given  higher  weights.  Velocity  data 
provide  useful  segmentation  inlormalion  only  it  mov  ing  targets  are  in 
the  scene  Therefore,  the  weight  on  velocity  segmentation  depends  on 
the  segmentation  outcome  of  individual  images,  Edge  maps  derived 
from  ludar  data  are  usually  of  a  lower  quality  and,  thus,  receive 
less  weights.  However,  edge  detection  usually  prttvidcs  better  cues 
than  region  segmentation  for  ladar  intensity  data  and  jump  edges  in 
ladar  range  data.  Thermal  segmentation  maps  usually  produce  precise 
MMO  Bfj  boundaries;  hence,  a  heavy  weight  is  given.  However,  the 
integr.tli'in  algorithm  is  robust  enough  even  if  strong  weights  are 
misiakenlv  given  to  the  wrong  information  sources  [I3|.  If  video 
intcnsiiv  images  (e.g..  from  TV  camera)  ,ire  available,  they  can  also 
contribute  to  the  integration  process. 

A  set  ot  utility  programs  collects  the  values  lor  various  attributes, 
such  as  region  size  and  standard  deviation,  using  the  original  images 
and  the  integrated  segmentation  map.  These  data  arc  converted  10 
the  representation  formal  of  W-F,  by  the  thiltihaic  and 

(he  database  is  then  transferred  to  the  interpretation  modules  (2j, 
The  database  generator  serves  as  the  interface  Irom  the  low-level 
segmentation  block  lo  the  high-level  interpretation  block 


V,  Thk  Intlrpruatiun  Systhm 

The  interpretation  modules  in  AIMS  consist  of  four  major  compo¬ 
nents:  1 1  the  inference  mechanism  provided  by  KF.f':  2)  the  rule  bases 
and  supplemental  l.isp  code,  which  contain  the  knowledge  for  image 
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Fig  (>  Block  iliagtam  lor  ihe  inicrprclaiion  syslem 


interpretation;  Ihe  Jaiu  buses,  which  are  produced  by  the  database 
generator;  .ind  4|  the  sen  in'  mutuuier.  which  executes  numerical  .ind 
graphics  tasks  lor  AIMS 

Ihe  interpretation  sttalegv  of  our  work  lollows  the  three-step 
paradigm  |21j  of  data  absiraclion.  heuristic  classiticalion.  and  re- 
lined  classihcaiion.  First,  numerical  paramelers  are  converted  into 
qualiialive  descriptors.  Second,  these  descriptors  are  used  to  generate 
intermediate  classifications  ol  negmetils  as  MMf)  s  I'r  background. 
Third,  segments  are  grouped  into  obiects.  and  these  objects  arc  further 
classitied  into  one  ol  'he  predetined  categories  Fig  (>  shows  the 
block  di.igram  lor  Ihe  interpretation  module  and  Us  operation  A 
more  detailed  description  ol  the  complete  inierprelalion  syslem  is 
presented  in  [22* 

The  rules  in  AIMS  are  organized  into  six  groups;  1)  data  prepa¬ 
ration,  2)  the  coarse  el.issilieation  of  segments  into  MMO'BG  using 
low-lexel  aliribules.  segment  grouping  using  geometric  intorma- 
lion.  4)  the  classiticalion  ot  B(i  stgments,'ohjeets,  .3)  the  clussitica- 
tion  ol  MM{)  segments  objects,  and  b)  consistency  verilicalion  ot 
neighboring  segment  objects  These  groups  ol  rules  are  sequentially 
invoked  in  the  F(  mode  At  any  given  time,  only  one  group  ol  rules 
is  aeiive  in  Ihe  maicb-n  sube  jirc  eyele.  However,  stages  4)  and  .s) 
can  tipcralc  in  parallel  The  eontiict  resolution  strategies  in  AIMS 
are  rule  wcii;hitne  and  I  II  I)  The  partition  of  rule  bases  reduces 
the  matching  overhead  of  rule  selection  and  provides  indirect  control 
over  ihc  breadih-firsi  search  implied  in  FC  Bf  roles  w  ill  be  added  in 
the  future  to  adopt  Ihe  hyputhesizc  anJ-verify  approach  for  ftx’used 
searches.  Thus,  when  a  hypotnesis  with  a  strong  confidence  is  posted. 
Ihc  reasoning  process  can  switch  into  the  BC  mode  to  verify  that 
hypothesis 

Despite  the  tlexibility  of  KI  !  .  lliree  major  issues  are  ideniilied 
as  its  weak  spots:  I)  execution  elfieiency,  2)  low -level  data  access 
during  high-level  reason  ■,  and  .^)  inlerfacc  capability  and  feedback 
lo  low-lcvcl  processes  I  isp-baseil  development  systems,  such  as 
Khb.  arc  convenient  tools  to  execute  symbolic  reasoning  tasks  and 
handle  explicitly  encoded  kniiw ledge.  However,  these  systems  usually 
dir  St)  at  the  price  of  software  t'vcrhead.  The  slowdown  iX'curs  tor 
two  main  reasons.  First,  most  such  packages  arc  built  on  multiple 
layers  of  software  and.  therefore,  arc  very  inefficient  .Second,  image 
iiilcrpre’ialion  is  not  a  task  that  consists  solely  of  symbolic  prtx'c.ssing. 
Our  solution  is  a  program  wriltei  n  ('  (the  service  manager),  which 
runs  concurrently  with  the  interpretation  modules.  The  purpose  of 
the  service  manager  is  to  help  AIMS  run  low-lcvcl  tasks  (I/O.  video 
display,  etc.)  efficiently  on  the  designated  development  platform.  The 
interpretation  module  sends  a  message  lo  the  service  manager  for 
the  desired  service.  The  service  manager  interprets  and  executes  the 
commands  and  feeds  the  results  back. 
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I  III-  s\-uni  IS  iniplLiiKiik'il  on  .in  IHM  KI  i’(  riinmnu  AIX  ( ii;s, 
iiul  In  iiMii.iin  iIk-  ('ri'jin.il  liuhir  r.inuc.  Inlonsiu,  vcltvily,  iiiul 
fi'nisliTcil  lhi.rin;il  mi.ini.'  I  ii;s.  S  aiul  II  arc  various  scurncnlalion 
maps  lot  inid  itic  low -level  iiilepralion  moilule.  i  ip_s  ')  amt  12 
are  llie  inieeraleil  secmenlaliori  maps  (lop)  am!  llie  inlerprelalion 
results  (i'otlonil  Hie  mieuraleil  sej;menlalion  maps  show  repion 
hoiiiularies  m  while  eonlonrs  overlaid  on  the  r.inpe  tmape  Tor  the 
inlerprelatii'ii  results,  white  regions  are  delevleil  larucis.  aiul  hl.iek 
aieas  tie  •.i.einenls  thal  tio  no|  have  a  hiuh  voMliilenee  imer['ielalion 
lopoihesi-  Some  ol  the  I’laek  areas  are  aeliially  elassihed  as  GROUND 
or  SKY  However,  die  eonluleiiee  laelors  lor  such  elassitiealioiis  (all 
helow  a  Ihreshold  (114)  .imi  are  eonsiilered  loo  weak  1o  reporl  1  iphi 
.urav  marks  GROUND,  and  ilark  erav  marks  SKY 

rile  lirsi  example  (lies  '-'m  eonsists  ol  a  2  s  ion  iriiek  and  an 
armored  personnel  carrier  (Al’Cl  at  '122  m  Hie  linal  secmenlalion 
ilelineales  die  enure  Irnek  as  a  sinple  sepmeiil  where. is  the  Al’C 
IS  split  into  two  sepmenls  and  then  proiiped  as  a  sinple  ohjec! 
Hie  inlerprelalion  process  classities  holh  seenieiils  .is  MMO’s  and 
recoum/es  diem  as  .t  TRUCK  .iiul  an  APC.  respecliv elv .  Hecaiise  (he 
(ruck  IS  oeeiiidcel  In  the  imape  Ir.ime  hoiindarv.  die  truck  driver's 
cahin  couhl  he  riiisiaken  tor  a  tank  turret.  However,  the  aiiaivsis 
of  ladar  inlensitv  imapes  ,ii  die  possible  liirrel  position  reveals  that 
the  left  larpet  is  a  truck  hecause  of  die  verv  weak  reitirn  Ironi  that 
portion  The  weak  return  ompared  with  its  surroiinilinps  imiicales 
hetiToueneilv  in  the  iiialeri.il  composition  of  the  upper  ceiilr.il  portion 
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ol  die  taipet.  An  open  sp.icc  is  iilcntdied  it  the  i.iil  ol  (he  M’(  . 
wliicli  supporis  die  interpret. ilion  .is  Al’(  In  eompaiinp  die  dierm.il 
and  l.idar  i.iiipe  iiilapes  Note  thal  the  dietm.il  sepment.ilion  module 
lines  not  iiichide  die  cavilv  ,ne.i  .is  part  ol  Al’C  hec.iuse  ol  the  lower 
leinpcraliiie  ol  lli.il  are.i.  Since  the  e.iviiy  is  l.irpe  enoiiph.  die  M’C 
IS  spill  into  iwo  sepiiienis  m  the  inlepraled  sepment.ilion  ni.ip  ll 
thermal  inlornialion  is  unav .lil.ihle.  Ilien  die  reiopnilion  process  musi 
relv  solelv  on  conloui  tiiiil  siiiuiural  .iii.ilysis  I  he  hodv  dimensions 
of  the  .M’C  .ire  correcilv  esiim.iied  iisinp  die  sp.iii.il  resolution  ol 
the  radar  system  .iiul  die  estimaled  l.irpel  ilist.mce  Hie  rol.ilioiis  .ire 
estimated  as  2S.1  lor  the  ,\l’(  .iml  2li  I  lor  the  truck.  T  his  is 
consistent  with  die  documenlation.  which  si.iies  ih.il  holh  l.irpcis  are 
parallel,  with  .i  rolalion  of  ahout  tii  ironi  the  viewiiie  direction 
Most  h.ickprouml  sepnieiils  ,ire  classilied  .is  GROUND.  Some  ol  them 
.ire  .ilso  classilied  as  SKY  with  weaker  (T  s  hec.iuse  of  wc.ik  l.id.ii 
iiiteiisiiv  return  .iiid  low  lemper.iliire 

Hie  second  example  (l  ips.  Ill  12)  consists  ot  two  iriicks  .iiul  i  iie 
hullelin  hoard  heivveen  ihem  .il  m.  The  tiuck  on  the  ripht  is 
not  niovinp.  whereas  the  truck  on  die  left  is  nun  nip  toward  the 
lad.ii  receiver.  'Hiere  is  .i  sironp  indie, ilion  ol  ,i  movim;  l.irpel  tor 
die  lefl  truck  in  die  velocilv  iiiiape  The  speed  is  esliin.iled  correidv 
In  the  averape  velocilv  value.  T  he  sepmenitition  module  succcsstiillv 
marks  out  die  three  ohjecls  .is  MMO's  correspondinp  lo  the  two 
milks  .mil  dll'  ccnii.il  hullelm  ho.inl  Holh  liuvks  leceive  du  loneii 
classiticalion  as  TRUCKs.  I  lowcvei.  Ihe  correspondinp  (  I  s  ,uc  we.ik 
Hie  rc.isoii  is  thal  Ihe  current  implemeiilation  id  the  contour  .inalvsis 
lilies  h.is  onlv  a  limileil  knowledpe  h.ise  to  rccocm/e  vehicles  liom 
.1  hroadsule  view.  T  heielore.  onlv  inlerprcl.ilion  rules  th.il  .iie  not 
stronciv  dependent  on  conlouis  prov  ule  the  coireci  cl.issitii atioii  I  lu 
liitret-an.ilvsis  rule  used  in  the  (ircvious  ex.imple  suppests  ih.il  the 
upper  strue'lure  of  die  Ivvo  l.irpcis  m.iv  he  trucks  Ihcrmal  iiiles  deteci 
Ihe  exhaust  pipe  ot  the  lell  truck  and  determine  th.it  die  cetilr.il  l.irpel 
IS  unlikelv  to  he  .i  he.il  source  T  he  widlh  ot  the  houndinp  recl.iiipk 
ol  the  larpel  is  nialched  .ip.iinsi  itic  width,  .iiid  noi  the  lenplli.  ol  .i 
poienlial  l.irpel  T  he  sutl.ice  lilliiip  error  lor  i.iiipe  d.ila  mdic.iics  ili.it 
the  central  MMO  sepiiieiil  is  .ilinosl  ll.it.  Its  sh.ipe  is  teclanpul.ii.  .iiid 
Us  si/e  (2.(17  Ill  hiph  ,uul  l  .s('  in  vviile)  is  smaller  than  dial  ol  most 
vehicles,  yet  it  is  taller  Ih.in  a  JEEP  Hie  si/e.  stii, tiler  surface  lillinp 
error.  nuHleratelv  hiph  lempcraiiire.  .ind  the  contour  can  he  used  .is 
evidetiie  lo  hypolhesi/e  .ind  sup|iorl  Ihe  i  l.iirii  ih.it  the  cenir.tl  scpiiiem 
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is  a  BULLETIN  BOARD  The  oilier  l«o  MMO's  have  surlaec  fillins: 
errors  ol'  alroiil  svhieh  is  approsimately  hall  the  lengi  oi  a 

truck.  The  wiJlhs  of  bolh  are  esiiniateil  to  he  2.4  aiul  2. .Vs  ni  facUially 
one  pixel  in  ilifferencel.  which  is  approximately  Ihe  vviillh  of  a  2..‘'-lon 
truck.  The  height  ol  the  vehicles  argues  against  a  JEEP.  The  il’.f'l'i 
gradient  is  used  to  estimate  that  the  rotations  (of  the  long  axis)  of  both 
trucks  are  about  III  from  the  viewing  direction.  This  measurement 
is  consistent  with  Ihe  documentation,  which  stales  that  Ihe  trucks  are 
parallel,  and  one  ol  them  was  backing  up  toward  Ihe  ladar. 

The  ilat.i  collection  and  database  gencralion  mothiles  between  the 
segmentation  block  .md  the  interpretation  block  take  about  2  min  of 
Cl’t'  lime  I'he  interpretation  block  lakes  about  |s  min  ol  wall  ch'ck 
lime  (.hvS  s  of  CIH'  time  due  to  extensive  memory  swapping)  to 
interpret  one  set  ot  images  (he  CIT'  lime  winiUI  have  been  hmger 
iroughiv  .'II  min  UCI  .ind  l‘t  min  CIH’)  if  the  ('-based  service 
manager  is  not  emploved.  and  hence,  all  the  low-level  processing 
tasks  must  be  coded  as  KTL  and  l  isp  functions. 

VII.  I’.VKAIIII  IMI’I  I  XII  M  VIIDN  Ol  till  IM  I  KPkl  I A1  ION  S  V  S 1 1  XI 

After  the  inlerprelalion  subsystem  stabili/es,  it  is  porievi  to  an 
AFiViT  for  further  improvement  in  run  time  statistics.  Kach  of  the 
M  nodes  in  the  available  I'lxi  I  machine  has  a  DSP.f2  chip  as  the 
CI’Ij.  Mt  KH  memory  lor  ci’de  and  25b  KB  for  data.  (’onse(|uenlly. 
onK  a  subset  of  the  rule  is  ported  to  the  I‘IM  1  machine  due 
to  the  limited  memory  sp.ice.  Instead  of  using  ()l’S-5  and  the  Rele 
network  |2.5|.  a  strategy  of  one  processor  per  imtige  segment  is  chosen 
(I'ig.  1.5).  rhe  issue  here  is  that  the  IM,\I  l  niitchine  adopts  a  message 
passing-based  architecture  l  ine-grained  parallelism  as  emphasized 
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in  |2’'l  would  be  diUicull  lo  explore.  I'herelore.  it  is  ptelerable  to 
duplicate  the  software  anil  Ihe  shared  data  to  reduce  the  amount  of 
message  communications. 

Data  extracied  from  l  ig.  7  ,ire  iraiishiled  into  Ihe  l)SP.52  lormal 
and  downloaded  with  Ihe  integrated  segmenlalion  map.  The  rules  art 
iniplemenled  together  with  the  inference  engine  (It-)  as  one  program 
in  (■  lo  save  the  lime  needed  lo  constriicl  a  general-purjiose  rule 
parser.  There  are  ten  rules  implemented  as  three  sections.  The  three 
sections  are  asserted  seijuenlially  in  the  l-C  mode.  Conllict  resolution 
is  determined  by  rule  weighting  :ind  by  FIFO  when  rules  have  equal 
weights.  Rules  are  of  various  degrees  of  complexity.  In  general,  they 
are  more  numerical  intensive  than  those  used  for  symbolic  matching 
in  OPS-5,  For  example,  one  rule  scans  the  entire  segmentation  map 
to  determine  the  bounding  reclangle  and  Ihe  symmetry  of  Ihe  contour 
sha|K’  ot  regions  as  |iarl  ol  its  left-hand-side  (I  IIS)  operations 
Another  rule  generates  all  Ills  messages  in  its  I.IIS  ireporling  the 
inierpretation  at  the  current  node  lo  its  neighbors)  and  processes  .ill 
ol  them  (examining  the  inlerprelalion  at  neighluiring  segments  for 
consistency )  in  its  right  hand  side  (RUS).  The  workload  ol  priK'essing 
these  messages  is  distributed  over  many  nodes,  of  course.  The  24 
data  segments  (based  on  Fig.  7)  are  distributed  among  24  nodes.  The 
remaining  41)  nodes  are  not  used  in  the  ex|x'rimeni.  A  data  frame 
describing  one  region  in  the  test  image  is  about  .5(HI  bytes.  An  average 
rule  lakes  about  1 .4  KB  SRAM  space,  whereas  data.  Ihe  IF.,  and  other 
housekeeping  codes  take  about  21  KB  SR.AM  space. 

The  total  run  lime  averages  (in  20  runs)  7,2  s  wall  clock  lime 
(WC'T).  including  b.wi  s  hosi-CPI!  time  (SIIN-.5  2,S0)  using  ihe 
UNIX  rime  command  With  a  I’lXi  i  machine  program  downloaded 
but  without  actually  running  it.  the  timing  statistics  are  liM.'s  s  t'Pl' 
host  and  7,11  WCT  (avertiged  Iron)  20  runs).  Therefore,  the  actual 
run  lime  with  code  and  data  ready  in  individual  nodes  is  0.1 -41 2  s 
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More  precise  measurementv  ;ire  nol  done  because  no  system  toot  is 
available.  Most  of  the  host  CPL  time  is  consumed  by  data  download 
and  .system  initialization  T.ual  rule  firing  is  1.T7.  and  at  least  240 
WME  changes  because  some  rules  change  several  elements.  The 
passage  and  processing  of  lOS  messages  generate  no  measurable 
increase  in  ooth  the  host  CPI'  and  ihe  WCT  (by  disaoling  the 
single  rule  that  generates  all  the  messages).  These  statistics  result 
in  an  equivalent  rule-firing  rate  of  7(Ki-14IK)  rules  s.  This  rule-firing 
.speed  is  close  to  the  dO.s  rules  s  pro]ecied  for  NON-VON  |24|,  In 
comparison,  the  complete  rule  base  of  47  rules  accelerated  by  a  f  - 
based  server  running  on  an  IB.M  RT  takes  15  min  WCT  and  56K  s 
CPU  lime.  The  IBM  RT  is  roughly  2.5  times  as  fast  as  a  VAX-1 1  '7SI1. 
whereas  a  V'AX-1  l/'/oO  has  a  rule-firing  rate  of  2-5  rules, s. 

Gupta  makes  a  good  point  [251  that  the  highly  inefficient  im¬ 
plementation  of  OPS-5  by  Lisp  and  RBS  built  cm  OPS-5  severely 
limits  the  use  of  such  systems  and  the  research  progress  Similar 
situations  exist  in  vision  research  when  the  recognition,  interpretation 
tasks  are  implemented  using  the  RBS  approach  It  is  reported  |25) 
that  C-based  impiementalion  alone  (with  some  optimization  effort) 
provides  a  speedup  of  1(1-2"  o\er  a  Lisp-based  implementation  in 
a  uniprocessor  encironment  If  Ihe  same  factor  is  applicable  (o  the 
simulation  results  in  this  experiment,  then  the  speedup  due  to  the 
reported  parallel  implementation  .s  i  .lOS/tl.d  •  1(1/  17 i/'iO  =:  10.  Vi 
without  considering  Ihe  speed  difference  between  the  CPU's  Ihe 
language  lacUir  as  2<l  may  b^  overestimated  because  the  .loX  s  run 
lime  is  obtained  from  Ihe  mixed-language  implemenlalion.  which  uses 
C  to  a  great  exicnl  [22],  The  ratio  ol  raw  speed  between  a  [)SP52 
chip  and  an  RT  CPU  is  roughly  5  to  S  in  the  sense  ol  nonun.il  MIPS 
It  Ihe  CPU  factor  is  also  con-idered,  then  the  p.trallel  implement, ilion 
(roughly)  achieves  linear  |24|  speedup.  II  ihe  mi.ige  contains  niorc 
segments  (up  to  64.  which  is  the  number  of  physical  Pt  nodes  in  the 
I'lM  i  machine),  the  run  time  on  the  Pixi  i  m.ichine  should  increase 
very  little.  In  comparison,  a  unipuKcssor  implcmcntalu>n  wall  use 
proporlionately  more  time  (w  r :  the  number  ol  segments)  However. 
Ihe  simulation  time  is  loo  short  lor  a  precise  run  time  measurement. 

Mil  f  "Nl  1  l  SKIN 

■We  presented  a  know  ledge -based  mulliscnsor  sysicm  (AIMS)  to 
inlerprel  egislered  laser  rad.ir  (ladar)  .ind  thermal  images  The 
objective  is  lo  detect  and  recogni/e  man-made  objects  al  kilometer 
range  in  ouldoor  scenes.  -\IMS  consists  of  rule-based  re.isoning 
modules  and  Ihe  segmentation  modules.  The  miillisinsur  fu\iim 
approach  is  applied  at  both  ihc  segmentation  and  re.i'oning  levels 
The  low-level  integration  module  fuses  segment, ilKin  cues  from 
multiple  image  sources  and  processing  techniques  to  generate  an 
improved  segmentation  m.ip  The  reasoning  system  iniegraies  high- 
level  informalion  from  v.inoii'  modalities  and  knowledge  sourees  in 
the  form  of  consisicni  inierpreiation  hypotheses  and  inere.ised  confi¬ 
dence  laclors.  AIMS  uses  forward  chaining  lo  drive  the  interpretation 
process  in  a  bottom-up  fashion  The  reasoning  process  lollows  the 
order  of  data  abstraction  (feature  extraction),  heuristic  cl.issificalion 
(target  detection),  and  refinement  verification  (target  reeogniiion). 
The  tasks  at  different  level'  of  Ihc  machine  vision  paradigm  arc 
performed  in  AIMS  using  different  software  loois  .md  methodologies 
A  subset  of  the  interpretation  'uhsyslem  ol  ..\IMS  is  implemented 
on  a  multicomputer  MIMD  environment  and  achieves  .timosi  linear 
speedup. 

Two  factors,  including  ll  multisensor  fusion  and  2)  the  hvpoilic- 
ses  integration  using  incxacl  reasoning,  enable  ..\IMS  lo  tolerate 
intermediate  errors  by  vcritvmg  image  scgmentalion  and  initial 
interpretation  incremcnially  Thermal  imaging  is  very  elfeclivc  in 
delecting  metallic  objects  and  vegelafton.  and  it  complements  the 
shape/structure  information  provided  by  ladar  imaging  A  high- 


qualily  segmentation  map  is  obtained  using  the  map  inlegration 
module  in  ALMS.  Consequently,  low-level  data  can  be  correctly 
aggregated  into  object-level  attributes,  which  is  essential  for  region- 
based  image  interpretation.  The  rule-based  interpretation  system 
cooperates  with  low-level  modules  for  effic'cncy  and  incremental 
verification  A  parallel  implementation  of  a  subset  of  the  complete 
interpretation  rule  bases  provides  similar  results  much  faster  than  the 
uniprocessor  mixed-language  implementation  The  speed-up  factor  is 
about  L5II  in  wall  clock  time  and  IKflO  in  CPU  lime,  ALMS  performs 
well  on  real  images  and  recognizes  unoccludcd  man-made  objects 
al  kilometer  range.  Further  Icsiing  is  still  under  way.  The  current 
implementation  ot  AIMS  is  only  a  proloiypc.  and  it  must  acquire 
additional  knowledge  lo  recognize  more  objects  When  the  problem 
domain  changes,  different  sets  of  object  models  and  recognition  rules 
have  to  be  built,  and  different  sets  of  features  may  be  needed  lor 
recognition 
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Abstract — A  new  approach  for  the  detection  of  large  man-made  objects  in  a  non-urban  area  using  a  tingle 
monochrome  image  is  presented.  In  this  study,  the  man-made  objects  may  be  unspecified  and  the  appearance 
of  the  objects  is  unpredictable.  Prominent  features  that  distinguish  man-made  objects  from  natural  objects 
are  identified.  A  computational  framework  of  applying  perceptual  organization  and  using  the  prominent 
features  is  presented.  Techniques  are  developed  to  group  low  level  image  features  hierarchically  into  a 
region-of-interest  (ROIi  likely  to  enclose  man-made  objects  or  a  substantial  part  of  the  man-made  objects. 
These  techniques  include  feature  extraction,  primitive  structure  formation,  and  segmentation.  Some  of  these 
methods  are  novel  and  others  present  unique  properties  and  advantages  compared  to  previous  related 
works.  Experimental  results  are  presented  using  real  images  that  include  several  different  man-made  objects 
in  complex  backgrounds  or  a  natural  scene  without  man-made  objects.  It  is  shown  that  the  located  ROls 
properly  enclose  the  man-made  objects  in  the  scenes. 

Object  recognition  Perceptual  organization  Geometric  structures  Feature  extraction 
Feature  representation 


1.  INTRODCCTION 

The  computer  perception  of  man-made  objects  in 
non-urban  scenes  is  a  challenging  task  in  computer 
vision  research.  It  also  presents  additional  complex¬ 
ities  and  difliculties  beyond  the  computer  perception 
of  objects  in  a  well-controlled  laboratory  or  factory 
environment.  This  paper  presents  a  new  approach  for 
the  automatic  detection  of  large  man-made  objects  in 
outdoor  non-urban  scenes.  The  methodology  presented 
here  is  based  on  perceptual  organization.  The  approach 
hierarchically  organizes  low  level  image  features  to 
higher  level  structures  to  find  a  region  of  interest  in  a 
given  image. 

The  environment  we  consider  for  image  acquisition 
is  a  non-urban  area  in  daylight  hours.  Large  man¬ 
made  objects,  such  as  bridges  and  electric  transmis¬ 
sion  towers,  may  be  present  in  the  area  among  natural 
objects,  such  as  trees,  bushes,  and  vegetation.  The 
man-made  objects  are  unspecified  and  their  appearance 
is  unpredictable.  Thus,  we  do  not  know  what  man¬ 
made  objects  may  appear  or  whether  there  is  a 
man-made  object  in  the  scene.  Given  a  single  mono¬ 
chrome  image  of  such  a  scene,  the  goal  is  to  automa¬ 
tically  detect  large  man-made  objects  in  the  image. 
Because  of  the  complexity  and  variation  of  man-made 
objects  and  the  uncertainty  of  the  natural  environment. 
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the  intermediate  goal  is  to  find  in  the  image  a  region-of- 
interest  (ROl)  most  likely  to  enclose  man-made  objects 
or  a  substantial  part  of  the  man-made  objects. 

Most  computer  vision  research  on  object  recognition 
has  focused  on  problems  with  specified  objects  in  con¬ 
trolled  environments.  For  example,  many  of  the  current 
vision  systems  try  to  recognize  objects  in  a  given  image 
with  a  uniform  background  and  one  or  more  objects 
whose  exact  models  arc  known  to  the  system.'*’  Even 
with  such  seemingly  well-defined  problems,  many 
obstacles  exist  and  considerable  research  efforts  are 
being  made.  Locating  man-made  objects  in  a  natural 
outdoor  environment  adds  yet  another  dimension  of 
complexity.  One  cannot  arrange  the  natural  environ¬ 
ment.  Natural  objects,  such  as  trees,  vegetation,  rivers, 
rocks,  and  clouds,  co-exist  in  the  scene  with  the 
possible  man-made  objects.  As  Fischler  and  Strat**’ 
observed,  it  is  seldom  possible  to  establish  complete 
boundaries  between  objects  of  interest  in  natural 
scenes,  and  very  few  natural  objects  have  compact 
shape  descriptions.  Therefore,  the  problem  of  detecting 
man-made  objects  in  a  natural  environment  is,  in 
general,  much  more  difficult  than  the  object  recognition 
tasks  in  a  well-controlled  environment. 

Many  researchers  have  investigated  the  automatic 
interpretation  of  outdoor  natural  scenes.  Some  of 
them  have  investigated  the  detection  of  man-made 
objects**"’’  other  than  buildings  and  roads.  These 
studies,  in  general,  use  additional  information  about 
the  scene,  such  as  color  or  range,  or  have  better 
knowledge  (models)  about  the  objects.  Most  of  the 


835 


836 


H  Q  Lt  and  J,  K  Aogarwal 


Other  i^ork  focuses  on  the  interpretation  of  aerial  im¬ 
ages  or  outdoor  robot  navigation.  In  the  first  area, 
techniques  base  been  developed  for  detecting  complex 
buildings  and  roads  m  which  buildings  are  modeled  as 
combinations  of  rectangles  with  uniform  intensity 
and  roads  a,  parallel  curves.'*' For  outdoor  robot 
navigation,  most  vision-related  work  concerns  road 
following''-  and  position  estimation."’"^'  Usually, 
a  sequence  of  images  is  used  for  the  navigation 
problems.  Thus,  w  hen  interpreting  each  image,  a  good 
initial  estimation  can  be  obtained  from  the  interpreta¬ 
tion  of  the  previous  images."’”' 

Unlike  the  previous  studies  cited  above,  this  investi¬ 
gation  has  the  following  features.  (1)  Man-made  objects 
which  may  appear  in  the  scene  are  unspecified.  (2) 
These  man-made  objects  may  have  more  complicated 
structures  than  the  buildings  which  appear  in  aerial 
images  and  cannot  be  easily  modeled  using  rectangles. 
(3)  It  is  hard  to  predict  the  presence  of  man-made 
objects  in  the  scene.  (4)  One  monochrome  image  is 
used,  and  no  color  or  range  information  is  provided. 
In  addition,  the  change  in  appearance  of  the  man¬ 
made  objects  caused  by  changing  view-direction  is  a 
more  severe  problem  than  in  top-view  aerial  images. 

Generally  speaking,  humans  can  detect  man-made 
objects  easily.  Psychologists  have  found  that  perceptual 
organization  or  perceptual  grouping  plays  an  important 
role  in  human  perception.  Perceptual  organization 
refers  to  the  human  s  visual  ability  to  derive  relevant 
groupings  or  structures  from  input  images  without 
prior  knowledge  of  their  contents."”  For  example, 
people  can  easily  detect  symmetry,  collinearity,  and 
parallelism.  Thus,  if  we  can  derive  similar  groupings 
or  structures  computationally  from  an  input  image, 
that  will  be  very  useful  to  the  detection  of  man-made 
objects,  especially  since  we  do  not  have  any  prior 
knowledge  of  the  image’s  contents. 

Based  on  the  principles  of  perceptual  organization, 
this  paper  presents  a  new  approach  for  the  detection 
of  large  man-made  objects  in  a  non-urban  area.  The 
system  finds  in  the  image  an  ROI  likely  to  enclose 
man-made  objects  or  a  substantial  part  of  the  man¬ 
made  objects.  Minimal  knowledge  and  information 
about  the  objects  and  scenes  are  used  in  this  work. 
Since  it  is  desirable  to  have  a  general  approach  able  to 
handle  a  variety  of  man-made  objects,  we  try  to  find 
prominent  features  distinguishing  man-made  objects 
from  natural  objects.  We  then  present  a  computational 
framework  for  applying  perceptual  organization  and 
using  the  discriminative  features  to  find  an  ROI  in  an 
image.  Other  researchers  have  also  applied  perceptual 
organization  to  various  computer  vision  tasks.'’'*'”"”' 
Their  work  has  important  impact  on  our  research. 
However,  these  studies  concentrated  on  grouping 
features  and  recognizing  objects  using  simple  generic 
models  or  exact  models  for  specified  classes  of  objects, 
Such  approaches  do  not  apply  to  our  problem  in  which 
the  objects  of  interest  are  unspecified  and,  in  general, 
have  complicated  structures. 

The  new  framework  presented  in  this  paper  is 


implemented  in  three  moduiev  which  correspond  to 
the  three  levels  of  the  grouping  procesi  The  first 
module,  feature  extraction,  detecis  linear  edges  from 
the  input  image  and  then  extracts  image  features  from 
the  edges.  The  image  features  include  hnrdr  uruciures 
(LS)  and  coterminous  lines  Uoterminanom  iCT it  Unlike 
other  collincarization  methods,'’'"-’*  our  method 
of  extracting  linear  structures  performs  line  inidmg 
and  line  extension  simultaneously  Coterminations  are 
represented  by  a  graph  called  a  CT graph.  The  second 
module,  primitive  structure  formation,  extracts  pnmitive 
structures  (PS)  in  the  image  by  grouping  image  features 
The  primitive  structures  include  parallel  PS  and  poly¬ 
gon  PS  The  definition  and  extraction  of  the  parallel 
PS  consider  more  general  situations  than  most  of  the 
previous  relevant  work  using  parallel  lines."  An 
efficient  and  systematic  procedure  is  developed  to  find 
polygon  PS  using  the  CT  graph  This  algorithm  is 
executed  in  polynomial-time  and.  hence,  is  much  more 
cfRcient  than  the  direct  search,  which  would  take 
exponential-time.  The  primitive  structures  are  then 
represented  by  a  graph  called  a  PS  graph  that  describes 
the  spatial  relationships  among  the  PS  Such  a  graph 
facilitates  efficient  higher  level  processing  The  third 
module,  segmentation,  segments  the  given  image  to 
obtain  an  ROI  most  likely  to  enclose  man-made  objeas 
or  a  substantial  part  of  the  man-made  objects  Using 
the  PS  graph,  we  develop  a  procedure  to  group  related 
PS,  to  eliminate  isolated  PS.  and  to  segment  the  image 
into  regions  occupied  by  the  grouped  PS  and  a  back¬ 
ground.  This  graphic  approach  is  efficient  and  allows 
a  parallel  implementation.  The  largest  region  of  the 
grouped  PS  is  then  evaluated  based  on  the  area  of  the 
region  and  the  statistics  of  the  PS.  If  this  region  is  deter¬ 
mined  to  be  significant,  then  it  is  deemed  to  be  the  ROI 
of  the  given  image. 

The  approach  presented  in  this  paper  is  different 
from  the  work  presented  by  Reynolds  and  Beveridge.'”’ 
In  reference  (23),  geometric  relations  among  lines  are 
extracted  and  represented  by  a  graph.  Our  method  is 
a  hierarchical  grouping  process.  We  define  primitive 
structures  and  identify  their  relationships.  The  advan¬ 
tages  are  the  abilities  to  extract  a  variety  of  geometric 
structures,  to  establish  higher  level  relationships  among 
image  features,  and  to  use  regional  information. 

We  present  several  examples,  including  different 
kinds  of  objects  in  non-urban  scenes  and  a  natural 
scene  without  any  man-made  objects.  We  show  that 
the  ROI  enclose  the  man-made  objects  in  the  scenes. 
These  examples  illustrate  the  ability  of  this  approach 
to  locate  useful  regions  of  interest  in  complex  real 
images.  The  identification  of  ROI  is  important  and 
may  be  necessary  for  the  initial  screening  of  a  large 
environment  to  detect  man-made  objects,  since  the 
search  space  can  be  reduced  to  the  located  ROI.  When 
specific  object  classes  arc  given,  the  primitive  structures 
composing  the  ROI  may  be  used  to  match  structures 
of  the  object  models  instead  of  matching  individual 
features.  The  ROI  may  also  be  used  in  human-machine 
systems  to  find  the  focus-of-attention  so  that  human 
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expertise  can  make  a  finai  determinaiion.  This  is  ap¬ 
plicable  to  real-time  operations,  such  as  assisting  an 
aircraft  pilot  by  looking  in  altcrnaie  directions  and 
providing  ROI,  and  to  off-line  processing  involving  a 
large  number  of  images. 

The  rest  of  the  paper  is  organized  as  follows.  Section 
2  briefly  reviews  previous  research  relevant  to  this 
work.  Section  3  overviews  our  approach.  Section  4 
describes  feature  extraction.  Section  5  defines  primitive 
structures  and  presents  techniques  to  derive  PS.  Section 
6  concentrates  on  the  region  of  interest.  Section  7  gives 
implementation  examples,  and,  finally.  Section  8  con¬ 
cludes  the  paper. 

2.  RELATED  WORK 

This  section  briefly  reviews  previous  work  relevant 
to  our  research.  The  review  mainly  includes  the  work 
of  applying  perceptual  organization  to  computer  vision 
tasks  and  the  detection  of  man-made  objects  in  outdoor 
natural  scenes. 

Perceptual  organization  has  been  studied  since  the 
early  part  of  this  century.  Gestalt  psychologists  studied 
a  large  number  of  grouping  phenomena  of  human 
vision  and  roughly  categorized  them  into  several 
Gestalt  Laws  (or  grouping  rules}  which  are  summarized 
as  follows:"'  *''  (1)  proximity:  closer  elements  tend  to 
be  grouped  together:  (2)  similarity:  similar  elements 
tend  to  be  grouped  together:  (3)  continuation:  elements 
that  lie  along  a  common  line  or  smooth  curve  are 
grouped  together;  (4)  closure:  curves  tenu  to  be  com¬ 
pleted  to  enclose  a  region;  (5)  symmetry;  elements 
symmetric  about  some  axis  are  grouped  together. 
Gestalt  psychologists  believed  these  factors  to  be 
important  in  the  perception  of  structures. 

In  recent  years,  perceptual  organization  has  been 
introduced  into  computational  vision  research  and  its 
functional  role  has  been  addressed."’^*'  Lowe"’* 
argues  that  the  most  important  functions  of  percep¬ 
tual  organization  include  segmentation,  three-space 
inference,  and  the  indexing  of  world  knowledge.  All  of 
these  lead  to  the  reduction  of  search  space  for  object 
recognition.  McCafferty"*’  formulates  perceptual  or¬ 
ganization  as  an  energy  minimization  problem.  He 
quantifies  the  Gestalt  Laws  by  defining  individual 
energy  terms.  However,  the  selection  of  the  weightings 
for  these  energy  terms  presents  problems. 

Recently,  perceptual  organization  has  been  applied 
to  solve  practical  computer  vision  problems."  ®  ' 

Mohan  and  Nevatia"'”  apply  perceptual  organization 
to  segment  images  into  visible  object  surfaces.  They 
also  investigate  the  detection  and  description  of  com¬ 
plex  buildings  in  aerial  images.'*’  Assuming  that  roofs 
are  the  essential  building  structure  seen  in  the  image, 
they  model  the  roof  as  a  combination  of  rectangles. 
Baker  el  alP  *'  present  an  approach  for  the  detection 
of  concrete  bridges.  The  straight  line  segments,  once 
detected,  are  grouped  into  parallel  lines.  Intrinsic 
rectangles  are  extracted  from  the  parallel  lines.  Color 
cues  are  then  used  to  restrict  the  candidate  artifacts 


and  to  produce  confidence  measures  However,  the 
rectangle-type  model  may  be  unsuitable  to  man-made 
objects  with  more  complex  structures  in  outdoor  scenes 
In  this  investigation,  we  deal  with  objects  with  com¬ 
plex  structures  as  well  as  those  with  simpler  structures 

Reynolds  and  Beveridge"”  examine  ihe  problem  of 
searching  for  geometric  structures  in  natural  scene 
images.  Straight  lines  are  grouped  using  the  geometric 
relations  of  collinearity,  parallelism,  orthogonality, 
and  spatial  proximity.  The  connected  components  of 
a  graph  representing  lines  and  their  relations  arc  il¬ 
lustrated  to  correspond  to  significant  geometric  struc¬ 
tures  in  the  image.  However,  some  compionents  may 
contain  many  different  image  events  In  addition, 
this  method  may  not  find  some  simple  geometric 
structures,  such  as  parallelograms  other  than  rectang^. 
Unlike  reference  (23),  our  method  groups  low  level 
features  into  primitive  structures.  The  spatial  relation¬ 
ships  among  the  primitive  structures  are  used  to  find 
the  ROI  most  likely  to  enclose  man-made  objects  or  a 
substantial  part  of  the  man-made  objects  The  advan¬ 
tages  of  finding  primitive  structures  and  identifying 
their  relationships  arc  the  abilities  to  extract  a  variety 
of  geometric  structures,  to  establish  higher  level  rela¬ 
tionships  among  image  features,  and  to  use  regional 
information. 

Jacobs"*’  presents  a  system  called  GROPER,  w  hich 
recognizes  two-dimensional  (2D)  objects  using  a  library 
of  many  different  objects.  GROPER  applies  perceptual 
organization  to  reduce  the  search  space  for  matching 
scene  objects  with  models.  GROPER  is  designed  for  a 
simplified  world  containing  only  2D  polygonal  objects, 
whereas  the  objects  encountered  in  our  research  are 
3D  objects  with  complex  structures. 

Various  techniques  are  presented  in  the  literature 
for  applying  perceptual  organization  to  group  lower 
level  image  features,  such  as  edge  points,  into  higher 
level  structures,  such  as  straight  lines  and  curves, 
and  to  detect  junction,  collinearity,  parallelism,  and 
symmetry.'*"'"’"*"”  Although  similar  to  some  of 
these  techniques,  the  methods  developed  in  our  research 
for  the  first  two  levels  of  grouping  have  unique  pro¬ 
perties  and  advantages.  We  discuss  the  differences  and 
advantages  after  describing  each  of  the  methods 

There  are  other  works  that  concern  the  interpretation 
of  natural  scenes  in  addition  to  those  using  perceptual 
organization.  Brooks'”  presents  the  identification  of 
aircraft  in  aerial  images  using  ACRONYM,  a  model- 
based  vision  system.  In  ACRONYM,  generic  object 
classes  and  specific  objects  are  represented  by  volu¬ 
metric  models  using  generalized  cones  along  with  sets 
of  constraints  relating  to  model  parameters.  Fua  and 
Hanson"®''’"*'  present  a  sequence  of  papers  on 
extracting  features  and  locating  general  cultural  objects 
such  as  buildings  in  aerial  images.  A  new  approach, 
based  on  information  theory,  to  evaluate  the  corres¬ 
pondence  between  generic  models  and  shape  hypotheses 
in  an  image  is  presented  in  references  (10, 28)  Generic 
models  for  buildings  are  formulated  and  experimental 
results  on  several  aerial  images  are  presented.  Beveridge 
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etaiP*'  present  a  method  for  identifying  known  21) 
models  in  imperfect  line  data.  The  method  is  applied 
to  complex  outdoor  scenes  and  good  matches  are 
demonstrated.  However,  the  above  approaches  are 
unsuitable  to  detection  problems  where  the  potential 
objects  are  unspecified,  as  in  the  case  of  our  research. 
Chu  et  of.'*  ’''  present  a  system  called  AIMS  to  detect 
and  recognize  man-made  objects  in  outdoor  scenes. 
Multiple  sensing  modalities  (range,  intensity,  velocity, 
and  thermal)  are  integrated  in  AIMS  to  improve  both 
low-level  image  segmentation  and  high-level  image 
interpretation. 

In  summary,  previous  work  has  concentrated  on 
extracting  groups  of  features;  using  simple  generic 
models  for  speciHed  classes  of  objects;  recognizing 
objects  with  exact  models;  or  using  additional  sensing 
information.  Our  task  is  to  detect  objects  with  minimal 
knowledge  and  information  about  the  objects  and 
scenes.  The  objects  are  unspecified  and  may  have  much 
more  complicated  structures  than  the  objects  con¬ 
sidered  in  the  previous  research. 

3.  overview  of  the  approach 

This  section  describes  the  basic  concepts  and  presents 
an  overview  of  our  approach.  The  approach  essentially 


oiftanizes  those  features  indicating  man-made  objects 
itiiii  structures,  and  finds  the  image  region  in  which 
related  structures  reside. 

The  goal  of  our  research  is  to  detect  man-made 
objects  from  images  of  natural  scenes.  Figure  1  shows 
an  example  of  such  an  image.  Since  the  objects  are  not 
particularly  specified,  features  must  be  found  that 
distinguish  man-made  objects  from  natural  objects  in 
an  image.  Two  of  the  most  prominent  characteristics 
of  man-made  objects  are  the  apparent  regularity  and 
relationship  of  their  components.  Most  man-made 
objects  have  linear  structures  or  linear  boundaries. 
The  linear  structures  form  certain  regular  patterns 
such  as  rectangles,  parallels,  and  polygons.  These 
regular  patterns  are  usually  related  to  each  other  and 
form  the  man-made  objects.  After  line  detection,  much 
of  such  regularity  and  relationship  remain  in  the 
resulting  image.  In  comparison,  most  natural  objects 
do  not  have  linear  structures,  and  lines  extractable 
from  their  images  are  usually  randomly  distributed. 

To  detect  man-made  objects,  geometric  structures 
with  regularity  and  relationships  must  be  extracted 
from  the  image.  Hence,  the  framework  of  our  approach 
includes  three  phases;  (1)  extracting  image  features, 
(2)  finding  regularities  and  relationships  among  these 
features,  and  (3)  identifying  the  region  occupied  by  the 


Fig.  I.  An  image  of  an  electric  transmission  tower. 
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related  regular  structures.  This  indicates  a  three  level 
hierarchical  grouping  process.  An  important  issue  in 
the  grouping  is  non-accidentalness.  Lowe*'"'  argues 
that  perceptual  groupings  are  useful  when  they  are 
unlikely  to  have  arisen  by  an  accidental  viewpoint  or 
position  and,  therefore,  are  likely  to  reflect  meaningful 
structures.  Our  computational  approach  to  perceptual 
grouping  should  agree  with  this  non-accidentalness 
principle. 

The  first  level  of  grouping  extracts  image  features. 
We  currently  consider  two  kinds  of  features:  linear 
structures  (LS)  and  coterminations  (CTJ.  A  linear 
structure  in  an  image  is  a  representation  of  a  set  of 
approximately  coUinear  line  segments  which  are  close 
and  likely  to  come  from  the  same  linear  structure  in 
the  scene.  Extracting  LS  reflects  the  proximity,  col- 
linearity,  and  continuation  of  the  perceptual  grouping 

A  cotermination  is  a  set  of  lines  terminating  at  a 
common  point  or  a  small  common  region.  In  practice, 
coterminous  lines  in  a  scene  may  not  terminate  at  a 
common  point  in  the  image  because  of  the  imaging, 
digitizing,  and  line  extraction  processes.  Instead,  they 
may  terminate  in  a  small  common  region.  The  coter¬ 
mination  is  an  important  relation.  According  to  the 
proximity  rule  of  perceptual  organization,  the  human 
visual  system  easily  groups  coterminous  lines.  In  fact, 
it  is  suggested  that  the  major  function  of  eye  movements 
is  to  determine  coterminous  edges.*’'”  Cotermination 
is  a  non-accidental  relationship  and,  hence,  reflects 
significant  structural  information.  It  is  also  view- 
invariant  in  a  wide  range  of  viewpoints  and  can  be  used 
for  3D  inference.*'  ’’  The  CT  arc  represented  by  a  graph 
called  a  CT  graph. 

The  second  level  grouping  organizes  features  that 
exhibit  regularity  and  relationship  into  larger  struc¬ 
tures  called  primitive  structures  (PS).  Each  PS  may  be 
evidence  indicating  man-made  objects.  In  addition,  a 
PS  implies  more  constraints  among  line  segments. 
Two  kinds  of  PS  are  currently  considered:  parallel  PS 
and  polygon  PS.  A  parallel  PS  is  a  set  of  parallel  lines 
satisfying  certain  conditions.  A  large  number  of  man¬ 
made  objects  contain  parallel  structures  and  human 
vision  can  rapidly  identify  parallel  lines.*'”  A  parallel 
relation  is  a  non-accidental  relationship  and  can  be 
used  to  infer  relations  in  three-space.*'”  A  polygon  PS 
is  a  closed  figure  that  consists  of  line  segments  and 
satisfies  certain  criteria.  It  is  also  a  significant  image 
relation.  According  to  the  closure  rule  of  perceptual 
organization,  human  vision  tends  to  complete  curves 
so  that  they  form  enclosed  regions.' '  ^ '  Extracting  closed 
figures  corresponds  to  this  feature  of  human  vision. 
Polygon  PS  are  non-accidental  image  events  since  the 
coterminations  forming  them  are  non-accidental. 
Hence,  they  represent  significant  structures  in  an 
image.  1  hey  are  also  higher  level  structures  than  lines 
and  coterminations.  Polygon  PS  can  also  be  used  for 
3D  inference.  For  example,  a  closed  figure  in  an  image 
suggests  a  closed  structure  in  3D  space. 

The  PS  are  represented  by  a  graph  called  a  PS  graph 
that  describes  the  spatial  relationships  among  the  PS, 


Such  a  graph  facilitates  efficient  higher  level  processing. 
Using  the  PS  graph,  the  third  phase  of  the  framework 
groups  spatially  closed  PS,  eliminates  the  isolated 
ones,  and  segments  the  image  into  regions  occupied  by 
the  grouped  PS  and  a  background.  The  largest  region 
of  the  grouped  PS  is  then  evaluated  based  on  the  area 
of  the  region  and  the  statistics  of  the  PS.  If  this  region 
is  determined  to  be  significant,  it  is  most  likely  to 
enclose  man-made  objects  or  a  substantial  part  of  the 
man-made  objects.  Then  it  is  called  the  ROI.  The 
rationale  for  this  level  of  grouping  is  the  following. 
(1)  Spatially  closed  PS  are  likely  to  be  related  and 
to  reflect  meaningful  structures.  For  example,  an  elec¬ 
tric  transmission  tower  is  a  connected  entity  and, 
hence,  the  PS  resulting  from  the  image  of  the  tower  arc 
spatially  closed.  On  the  other  hand,  spatially  closed  PS 
are  more  likely  to  be  perceptually  grouped  according 
to  the  proximity  grouping  rule.  (2)  Some  PS  may  be 
caused  by  accidental  image  relations  of  natural  objects. 
For  example,  line  segments  extracted  from  a  cluster  of 
tree  leaves  may  accidentally  form  a  parallel  PS.  Such 
PS  tend  to  be  randomly  and  sparsely  distributed  and 
are  unlikely  to  form  meaningful  structures.  (3)  Man¬ 
made  objects  usually  consist  of  regular  structures 
related  structurally  and  spatially. 

In  summary.  Fig.  2  illustrates  the  overall  data  re¬ 
presentation,  relationship,  and  flow  in  this  frame¬ 
work.  Down-arrows  in  the  figure  represent  “consist 
of”,  up-arrows  denote  “derive”,  and  lines  express  “re¬ 
present”.  “Par-PS”  and  “Pol- PS”  represent  parallel 
PS  and  polygon  PS,  respectively.  Figure  2  shows  that 
CT  and  parallel  PS  consists  of  lines;  CT  are  represented 
by  the  CT  graph  from  which  polygon  PS  are  derived; 
and  all  PS  are  represented  by  the  PS  graph  from  which 
the  ROI  is  derived.  The  data  abstractions  of  the  three 
level  grouping  are  also  indicated  in  Fig.  2. 

The  term  primitive  structure  is  also  used  in  refer¬ 
ence  (26)  to  represent  a  larger  class  of  entities,  including 
edges,  regions,  parallelism,  symmetry,  repetition,  and 
so  forth.  However,  in  this  paper  primitive  structure 
represent  the  regular  patterns  formed  by  straight  line 


Fig.  2.  The  data  representation  and  data  relationship  in  the 
framework. 
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segments.  Basically,  we  consider  the  finer  classification 
of  the  structural  entities,  since  these  entities  are  not 
grouped  at  the  same  level.  For  example,  a  line  segment 
is  a  grouping  of  edge  points,  whereas  a  parallel  is  a 
grouping  of  line  segments. 

The  implementation  of  the  framework  presented  in 
this  paper  consists  of  three  modules:  feature  extrac¬ 
tion,  PS  formation,  and  segmentation.  Each  module 
implements  one  phase  of  the  framework.  The  next 
three  sections  describe  the  techniques  developed  for 
the  implementation. 

4.  feature  EXTRACTION 

This  module  first  extracts  linear  edges  (or  line 
segments)  from  the  input  image.  These  edges  are  the 
basic  information  used  for  the  grouping  process.  Linear 
structures  and  coterminations  are  then  extracted  from 
the  linear  edges. 

4.1.  Unear  structures 

We  used  an  existing  method'^''  to  detect  straight 
line  segments  from  the  intensity  image.  For  practical 
reasons,  the  extracted  line  segments  may  not  reflect 
well  the  linear  structures  in  the  image.  A  post-proces¬ 
sing  is  required  to  obtain  the  linear  structures. 

For  this  purpose,  a  representative  line  to  a  set  of 
closely  bunched  and  similarly  oriented  linear  edges 
should  be  found,  since  these  edges  represent  the  linear 
structure  of  an  object  at  a  higher  granularity  level  than 
the  edges  themselves.'®’  For  example,  we  want  to 
extract  the  linear  structure  implied  by  a  set  of  line 
segments  shown  in  Fig.  3. 

However,  most  of  the  collinearization  techniques, 
such  as  references  (9, 2 1, 24),  are  unsuitable  for  extract¬ 
ing  linear  structures,  since  they  only  link  approximate 
collinear  lines  by  examining  the  neighborhoods  of  the 
end  points  of  each  line;  that  is,  they  perform  a  line 
extension.  Our  objective  of  extracting  linear  structures 
is  quite  similar  to  that  presented  by  Mohan  and 
Nevatia.'*'  In  reference  (8),  the  space  around  each  line 
segment  is  folded  onto  the  segment  repeatedly  to 
obtain  a  single  line  representing  the  grouped  line 
segments.  However,  this  folding  technique  does  not 
perform  a  line  extension.  The  techniques  developed  in 


Fig.  3.  Extracting  a  linear  structure  from  a  set  of  line  seg¬ 
ments. 


this  paper  perform  both  folding  and  line  extension 
simultaneously. 

According  to  Gestalt  Laws  of  proximity,  similarity, 
and  continuation,  a  set  of  proximate  lines  with  similar 
orientations  tends  to  be  perceived  as  a  continuous  line. 
The  lines  arc  likely  to  come  from  the  same  linear 
structure  in  the  scene  and,  hence,  should  be  merged 
into  one  line.  Two  techniques,  the  neighborhood  method 
and  the  classification  method,  are  developed  to  extract 
linear  structures.  The  neighborhood  method  iteratively 
groups  and  merges  similarly  oriented  lines  in  a 
neighborhood  of  each  line  segment.  The  classification 
method  groups  line  segments  by  classifying  them 
according  to  orientation,  collinearity,  and  proximity.'^*’ 
In  our  current  system,  the  two  LS  extraction  modules 
are  concatenated.  The  result  of  the  LS  extraction  is  a 
set  of  lines  including  the  representative  lines  of  the 
grouped  line  segments  and  the  un-grouped  line  seg¬ 
ments. 

More  details  of  the  neighborhood  method  are 
presented  below.  We  first  introduce  three  definitions. 
Two  lines  have  similar  orientations  if  the  angle  bet¬ 
ween  the  two  lines  is  less  than  a  threshold,  called  the 
similarity-angle.  The  neighborhood  of  a  line  segment  L 
is  a  symmetric  elongated  region  with  L  as  the  medial 
axis  of  the  region.'^^'  Two  line  segments  are  close  if  at 
least  one  end  point  of  one  line  segment  is  in  the 
neighborhood  of  the  other  line  segment. 

The  idea  of  the  grouping  process  is  as  follows.  The 
neighborhood  of  each  line  is  searched  to  find  all  lines 
with  orientations  similar  to  the  current  line,  called 
the  base  line.  The  resulting  set  of  lines,  including  the 
base  line,  is  then  replaced  by  a  representative  line.  The 
process  continues  until  no  replacement  occurs. 

To  reduce  the  search  space,  a  line  segment  is 
represented  by  its  two  end  points  and  is  indexed  by  the 
image  pixels  corresponding  to  the  end  points.  When 
searching  for  lines  close  to  a  base  line,  the  neighborhood 
of  the  base  line  is  searched.  Hence,  only  those  lines 
whose  end  points  fall  into  this  neighborhood  are 
examined.  After  a  set  of  lines  5  is  found  with  respect 
to  a  base  line  L,  with  LeS,  a  representative  line  L,  of 
5  is  computed.  L,  passes  through  the  point  that  is  the 
geometric  center  of  the  line  segments  in  S.  The  orienta¬ 
tion  of  L,  is  the  length-weighted  average  of  the  orienta¬ 
tions  of  the  lines  in  S.  To  determine  the  end  points  of 
all  the  end  points  of  the  line  segments  in  S  are 
orthogonally  projected  onto  L,.  The  two  furthest  apart 
projection  points  are  the  end  points  of  L,.  L,  replaces 
the  lines  in  S.  The  process  continues  until  no  merge 
occurs.  It  always  terminates  after  a  finite  number  of 
iterations,  since  there  is  a  finite  number  of  lines  and 
since  this  number  declines  in  each  iteration. 

4.2.  Coterminations  and  their  representation 

A  cotermination  (CT)  is  a  set  of  lines  terminating  at 
a  common  point.  For  practical  reasons,  we  only 
require  lines  in  a  CT  to  terminate  in  a  small  common 
region.  To  extract  CT  from  a  set  of  lines,  we  use  a 
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two-step  procedure.  First,  elementary  CT  are  extracted. 
Then,  closed  elementary  CT  are  grouped  if  they  satisfy 
certain  conditions.  This  procedure  is  similar  to  the  one 
in  reference  (21)  except  that  we  allow  more  than  two 
lines  in  an  elementary  CT.  In  addition,  we  derive  a 
representation  for  CT  and  their  relationships. 

An  elementary  CT  is  a  set  C  of  lines  terminating  in 
a  small  common  region:  each  pair  of  lines  { L„L^}eC 
satisRes  the  following  conditions  (sec  Fig.  4); 

(1)  S^6^n-S; 

(2)  max{d„d;}  s 

where  0  is  the  angle  between  L,  and  Lj,S  the  similarity- 
angle,  and  w  X  w  the  size  of  a  neighborhood.  Let  e^  and 
Cj  be  the  end  points  of  L,  and  Lj  that  fall  into  the  small 
common  region;  and  let  s  be  the  intersection  of  the  two 
lines.  Then  di  and  dj  are  the  distances  from  Cj  and  to 
s,  respectively.  The  above  conditions  are  the  same  as 
the  first  three  conditions  in  reference  (21).  Condition  (1) 
excludes  lines  that  are  approximately  cotlinear  or  that 
form  a  sharp  angle.  These  situations  are  considered 
in  the  linear  structure  extraction,  and  such  lines  are 
unlikely  to  correspond  to  a  CT  in  a  scene.  When  an 
elementary  CT  is  extracted,  we  say  that  line  Lj(Lj) 
joins  the  CT  at  e^lej].  Similar  to  extracting  LS,  lines 
arc  represented  and  indexed  by  their  end  points  to 
reduce  the  search  space.  For  each  line,  only  the 
neighborhoods  of  its  end  points  are  searched  instead 
of  searching  ail  the  other  lines. 

The  extracted  elementary  CT  are  further  grouped 
into  CT.  An  elcmentars  CT  C  initiates  a  CT.  Other 
elementary  CT  are  examined  if  they  have  a  common 
line  with  C  and  if  the  line  joins  them  and  C  at  the  same 
end  point.  For  example,  Fig.  5(a)  has  three  elementary 
CT,  Cj  contains  lines  L,  and  Lj,  Cj  contains  lines  Lj 
and  L3,and  Cj  contains  lines  Lj  and  L^.  If  Cj  initiates 
a  CT,  Cj  is  examined  but  Cj  is  not  because  although 


Fig.  4.  A  cotermination  is  a  set  .of  line  segments  terminating 
in  a  small  common  region. 


(a) 

Fig.  5.  Elementary  CT  are  grouped  into  CT. 


C,  and  C,  have  a  common  line  Lj,  L,  joins  them  at 
two  different  end  points.  Let  an  elementary  CT  Cj  be 
examined  for  a  possible  grouping  with  C.  Cj  is  grouped 
with  C  if  all  the  end  points  at  which  lines  join  C  and 
those  at  which  lines  join  C,  fall  into  a  common  region 
of  size  w  X  w.  All  the  elementary  CT  are  thus  processed, 
resulting  in  a  set  of  CT.  Some  of  the  CT  are  just 
elementary  CT  while  others  are  unions  of  elementary 
CT. 

The  CT  and  their  relationships  arc  represented  by 
a  graph  called  a  CT  graph.  A  representation  is  necessary 
to  derive  efficient  and  systematic  procedures  for  higher 
level  processing  that  locates  and  uses  CT.  In  the  CT 
graph,  each  vertex  represents  a  CT  in  the  image.  Two 
vertices  are  connected  by  an  edge  in  the  greph  if  the 
two  corresponding  CT  share  a  common  line  seg.ment 
at  the  two  different  end  points  of  the  line.  Figure  6 
shows  a  set  of  line  segments  with  six  CT  and  the 
corresponding  CT  graph.  The  CT  graph  enables  os  to 
extract  higher  level  structures. 

5.  PRIMmVESTRLCTURE  FORMATION 

This  section  defines  primitive  structures  (PS)  and 
describes  techniques  of  grouping  features  into  PS.  Two 
kinds  of  PS  are  discussed;  parallel  PS  and  polygon  PS. 

5.1.  Parallel  primitive  structure 

The  distinction  between  the  parallel  PS  defined  in 
this  paper  and  parallel  lines  as  defined  by  previous 
researchers**’*  ***  is  the  requirement  for  the  overlap 
between  parallel  lines.  Usually,  only  a  certain  overlap¬ 
ping  using  an  orthogonal  projection  is  required.'*  *’*  *’ 
For  example,  in  Fig.  7(a),  the  overlapping  between  the 
line  segments  Li  and  Lj  using  an  orthogonal  projection 


Fig.  6.  A  set  of  line  segments,  CT,  and  their  CT  graph. 
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local  orientacioa 


intrinsic  orienution 


(b) 


Fig.  7.  Lines  with  overlapping  in  different  directions. 


is  the  line  segment  AB.  However,  according  to  such  a 
definition,  when  the  intrinsic  orientation  of  a  set  of 
similarly  oriented  lines'**’  is  different  from  the  local 
orientation  of  each  line,  these  lines  may  not  be  grouped. 
As  a  result,  many  apparent  parallel  lines,  such  as  those 
shown  in  Figs  7(b)  and  (c),  will  not  be  identified  when 
there  is  an  angular  difference,  d,  between  the  intrinsic 
and  local  orientations  and  when  the  overlapping 
between  lines  is  small  using  the  orthogonal  projection. 
Such  situations  arise  very  often  in  practice.  For  example, 
a  set  of  parallel  lines  in  3D  space  may  fall  into  this 
situation  under  the  2D  projection  of  the  imaging 
process.  To  solve  this  problem,  we  define  two  addi> 
tional  overlapping  conditions  in  two  perpendicular 
directions.  The  idea  here  is  similar  to  the  0-aggregation 
of  Marr’s  earlier  work.'**' 

The  parallel  PS  is  a  set  of  lines,  S  =  { L , ,  Lj, . . . ,  L* } , 
m  ^  2,  that  have  similar  orientations.  In  addition,  for 
each  line  L  (SS,  there  exists  a  line  LjsS  such  that; 

(1)  Lj  and  Lj  have  similar  lengths. 

(2)  L,  and  Lj  have  a  sufficient  overlap  in  one  of  the 
three  projections,  i.e. 

g(^,(L,),^,(Lj))^  ^ 

where  )  is  the  projection  of  a  line  onto  the  x-axis, 
£’(■)  the  length  of  a  line,  C‘(L,L'}  =  J^(LnL')  the 
length  of  the  overlap,  and  the  shorter  line  of  L,  and 

Lj\ot 

where  is  the  projection  of  a  line  onto  the  v-axis;  or 

&(3>„(L,),L^)^  ^ 


where  Lf  is  assumed  to  be  the  shorter  line  and  ^.(L,) 
the  orthogonal  projection  of  L,  onto  Lj. 

(3)  L,  and  Lj  are  relatively  dose. 

The  above  conditions  are  based  on  the  perceptual 
organization  rules  of  proximity,  parallelism,  and  simi¬ 
larity.  Condition  (1)  requires  that  two  line  segments 
have  a  similar  length.  Two  lines  with  very  different 
lengths  are  unlikely  to  come  from  a  parallel  structure 
and  are  unlikely  to  be  perceptually  grouped.  Condi¬ 
tion  (2)  indicates  that  two  line  segments  in  a  PS  should 
have  a  sufficient  overlap.  With  the  defined  set  of 
overlapping  conditions,  lines  in  Figs  7(a)-(c)can  all  be 
properly  grouped.  Condition  (3)  restricts  the  relative 
distance  between  the  two  line  segments. 

The  line  segments  satisfying  the  above  conditions 
are  grouped  into  parallel  To  avoid  a  brute-force 
search,  line  segments  with  similar  orientations  are  first 
classified  into  dusters.  A  further  grouping  is  performed 
within  each  cluster  to  find  all  parallel  PS. 

S.2.  Polygon  primitive  structure 

To  extract  polygon  primitive  structures,  we  first  find 
closed  figure  (polygons)  formed  by  line  segments  using 
coterminations.  We  then  check  them  to  determine  if 
they  sausfy  the  conditions  of  a  polygon  PS.  Coter¬ 
minations  may  form  arbitrary  polygons.  Since  our 
purpose  of  finding  polygons  is  to  extract  significant 
image  structures  reflecting  man-made  objects,  we  want 
to  retain  closed  figures  with  “regular”  shapes  as  poly¬ 
gon  PS  and  to  discard  those  with  “irregular”  shapes. 
We  also  want  to  include  a  variety  of  polygons.  For 
example,  polygons  in  Fig.  8  will  be  retained  and  those 
in  Fig.  9  will  be  discarded. 

Formally,  we  define  a  polygon  PS  as  follows.  A 
closed  figure  P  is  a  polygon  PS  if: 


O  O 


Fig.  8.  These  polygons  are  accepted  as  polygon  PS. 
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Fig.  9.  These  polvgons  are  discarded. 


(1)  P  is  a  simple  polygon,  that  is,  the  edges  of  P  do 
not  intersect  among  themselves: 

(2)  P  is  relatively  compact:  ^lP)^S2^{<i(P)). 
where  ^i-)  is  deAned  as‘^^' 

area{P) 

is  the  convex  hull  of  P,  and  2: 1  is  a  constant; 

(3)  P  docs  not  have  many  cavities:  ^  6]n„  where 
n,  is  the  number  of  vertices  of  P  inside  "iflP),  the 
number  of  vertices  of  P  on  <(f(P),  and  (Sj  ^  1  is  a 
constant; 

(4)  the  number  of  edges  on  P  does  not  exceed  a  given 
threshold. 

Finding  closed  Agures  can  be  done  by  tracing  the 
lines  or  CT.  Starting  from  an  end  point  of  a  line  and 
going  along  the  given  lines  or  CT,  if  we  can  come  back 
to  the  starting  point,  a  closed  figure  is  found.  (Practi¬ 
cally,  as  in  extracting  CT,  we  cannot  expect  a  point 
connection  between  lines  and  to  trace  back  to  a  point. 
Instead,  we  should  consider  small  regions.)  However, 
the  time-complexity  of  this  kind  of  direct  search  is 
exponential.  Alternatively,  we  propose  a  novel  poly¬ 
nomial-time  algorithm  based  on  Graph  Theory.'**-^*' 
(Appendix  A  details  this  algorithm.)  The  approach 
Ands  a  set  of  independent  and  sufficient  closed  Agures 
using  the  CT  graph.  The  closed  Agures  are  then 
checked  with  the  conditions  for  the  polygon  PS.  The 
ones  satisfying  the  conditions  are  accepted  as  polygon 
PS  and  the  rest  are  discarded. 

5.3.  The  representation 

To  facilitate  higher  level  processing,  the  primitive 
structures,  including  parallel  PS  and  polygon  PS,  are 
represented  by  a  graph  called  the  PS  graph.  Each  node 
in  the  PS  graph  corresponds  to  one  PS.  Let  P,  =  'i(PSi) 
and  Pj  =  V(PSj)  be  convex  hulls  of  two  PS,  PS,  and 
PSj  (i  ^ j),  respectively.  Then,  the  two  nodes  corre¬ 
sponding  to  PS,  and  PSj  are  connected  in  the  PS  graph 
if  and  only  if  P,  and  Py  intersect,  that  is,  P,nPy  ^  0. 
Thus,  the  spatial  relationships  among  PS  are  represented 
explicitly  using  the  PS  graplt  For  example,  in  Fig  10(a), 
lines  1-3  form  PS,,  lines  4  and  5  form  PS2,  lines  3,4, 
and  6  form  PS3,  and  lines  7  and  8  form  PS*.  Figure 
10(b)  shows  the  convex  hulls  of  these  PS.  Three  convex 
hulls,  P,,  Pj,  and  Pj  overlap  each  other  and,  hence,  the 
corresponding  nodes  in  the  PS  graph  are  connected. 
Figure  10(c)  shows  the  PS  graph. 

The  PS  graph  can  oe  generalixed  if  we  define  a 
spatial  distance  between  two  PS.  For  example,  the  dis¬ 
tance  between  PS,  and  PS,  (i  #  j)  may  be  defined  as  the 
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Fig.  10-  Four  primitive  structures  and  their  PS  graph. 


minimum  distance  between  P,  and  Pj  if  p.nPj=:  0,  and 
as  zero  otherwise.  Then  the  two  nodes  corresponding 
to  PS,  and  PSj  are  connected  in  the  PS  graph  if  and 
oniy  if  the  distance  between  PS,  and  PSj  is  smaller  than 
or  equal  to  a  given  threshold.  The  advantage  of  such 
a  generalization  is  the  ability  to  include  closeness  in 
the  next  level  of  grouping.  The  price  to  be  paid  is  the 
increased  computation. 

&  SEGMENTATION 

The  segmentation  module  segments  the  given  image 
to  obtain  ROI  that  is  most  likely  to  enclose  man-made 
objects  or  a  substantial  part  of  the  man-made  objects. 
The  significance  of  this  segmentation  is  then  evaluated. 

6.1.  Grouping  related  PS 

As  discussed  earlier,  spatially  closed  PS  are  likely  to 
be  related  and  to  reflect  meaningful  structures,  whereas 
randomly  and  sparsely  distributed  PS  are  likely  to  be 
formed  accidentally.  Therefore,  for  man-made  object 
detection,  we  want  to  And  spatially  related  PS  and  to 
eliminate  accidental  PS.  Each  PS  occupies  a  region  in 
the  image.  For  example,  a  PS  containing  two  parallel 
lines  occupies  a  trapezoidal  region.  The  regions  of 
spatially  proximate  PS  tend  to  overlap,  touch,  or  be 
close,  whereas  the  regions  of  sparsely  located  PS  are 
isolated.  Hence,  we  can  segment  an  image  by  grouping 
PS  whose  regions  intersect  or  are  close.  The  largest 
image  region  occupied  by  the  grouped  PS  is  considered 
as  the  ROI  if  the  significance  of  this  region  exceeds  a 
given  threshold. 

An  algorithm  is  presented  to  achieve  the  segment¬ 
ation  using  the  PS  graph.  The  region  of  a  PS  is  deAned 
as  the  convex  hull  of  the  line  segments  in  the  PS. 
The  connectivity  of  the  nodes  in  the  PS  graph,  hence, 
reflects  the  intersection  and  the  closeness  of  the  regions 
of  the  PS.  Each  connected  component  with  more  than 
one  node  of  the  PS  graph  corresponds  to  a  set  of  PS 
whose  regions  intersect  or  are  close.  Therefore,  seg¬ 
menting  the  image  into  regions  that  include  those 
occupied  by  the  grouped  spatially  related  PS  and  a 
background  can  be  accomplished  by  finding  connected 
components  of  the  PS  graph. 
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Given  the  PS  graph,  we  first  find  the  connected 
components  of  the  graph.  For  each  component,  the 
corresponding  PS  are  grouped  into  a  larger  structure. 
This  structure  is  again  called  a  PS  for  the  simplicity  of 
the  algorithm  description.  A  new  PS  graph  is  then 
derived  for  the  new  PS  obtained  from  the  grouping. 
The  connected  components  of  the  graph  are  found, 
leading  to  further  grouping  of  the  PS.  This  process 
continues  until  no  PS  can  be  grouped.  Our  exper¬ 
iments  show  that  only  a  small  number  of  iterations  are 
actually  needed.  More  details  of  this  algorithm  are 
given  in  Appendix  B.  This  process  results  in  an  image 
segmentation  that  includes  regions  occupied  by  the 
grouped  PS  and  a  background.  Currently,  the  largest 
region  3t  of  the  grouped  PS  is  evaluated  for  considera¬ 
tion  as  the  ROI. 


6.2.  Significance  evaluation 

The  significance  of  the  region  ^  obtained  in  the 
segmentation  is  evaluated.  If  if  is  significant,  then  it  is 
the  ROI  most  likely  to  enclose  man-made  objects  or  a 
substantial  part  of  the  man-made  objects.  Naturally, 
the  area  of  J?  is  a  measure  of  significance.  If  the  area 
is  very  small,  jf  is  not  an  ROI.  Since  PS  are  likely  to 
arise  from  man-made  objects,  the  probability  of  PS 
falling  into  Jt  and  the  probability  of  lines  forming  PS 
in  Jf  are  other  measures  of  significance. 

Let  A  be  the  event  that  lines  form  a  PS,  B  be  the 
event  that  lines  fall  outside  Jl,  and  C  be  the  event  that 
lines  fall  inside  Ji.  Let  .V.Ng,  and  Nc  be  the  numbers 
of  lines  in  the  image,  outside  3i,  and  inside  respec¬ 
tively;  and  let  Af,  Mg,  and  M^  be  the  numbers  of  lines 
forming  PS  in  the  image,  outside  01,  and  inside 
respectively.  Then  the  probability  of  lines  forming  PS 
is  P{A)  =  M/N.  The  probability  of  a  PS  being  inside 
is  P{C\A)  =  Mc/M.  The  probability  of  lines  in  ^ 
forming  PS  is  P(A\C)  =  Mc/Nc.  The  probability  of 
lines  outside  forming  PS  is  PfAlB)  =  Since 

most  PS  are  expected  to  come  from  man-made  objects, 
and  since  an  ROI  should  be  the  region  occupied  by 
these  PS,  the  probability  that  a  PS  falls  into  the  ROI 
should  be  high  and  the  probability  that  lines  form  PS 
in  the  ROI  should  be  higher  than  that  outside  the  ROI. 
Therefore,  the  significance  of  M  is  determined  using  the 
following  three  conditions: 

fl)  P(CM)  2:  max  (0.5, ,s/,} +«; 

(2)  P{A\C):^PiA\B)  +  e; 

(3)  s/,  >  jsfo; 

where  =*  Area[dt\  Areailmage),  j0q  >s  a  threshold, 
the  e  is  a  small  number. 

Each  test  result  for  the  above  conditions  is  repres¬ 
ented  by  a  function  defined  in  the  interval  [0,1], 
indicating  how  well  jf  satisfies  the  condition.  The  first 
two  functions  p,  and  p,  are  of  the  same  form  represent¬ 
ing  the  first  two  conditions.  Let  the  left-hand  sides  of 
conditions  ( 1)  and  1 2l  be  /  and  the  ri^t-hand  sides  be  g. 
Then  p,,  i-  1,2,  are  defined  as 


Pr 


1  /SP 

/-(P-Al 


g-A^f^g 


[0  f<g-A 

where  A  is  a  number  controlling  the  drop  of  pi  from  1 
to  0.  The  last  function  is  defined  as 


P3  = 


1  j/,  2:  j/o 

j/,  <j/o 


The  significance  of  region  Jf  is  defined  as 

3 

Z  ‘"iPi 

I 

P=-T - 

j>  I 


where  aj,s  are  weights  representing  the  relative  im¬ 
portance  of  the  conditions.  Hence,  p  is  defined  in 
the  interval  [0, !].  We  select  the  equal  weights  in  the 
following  examples.  If  p  of  is  equal  or  close  to  1,  ^ 
is  the  ROI.  Otherwise,  01  is  rejected. 

In  summary,  we  have  presented  techniques  to  group 
low  level  image  features  hierarchically  into  an  ROI 
likely  to  enclose  man-made  objects  or  a  substantial 
part  of  the  man-made  objects.  These  techniques  include 
feature  extraction,  primitive  structure  formation,  and 
segmentation.  Some  of  these  techniques  are  novel  and 
others  present  unique  properties  and  advantages  com¬ 
pared  to  previous  related  work. 


7.  EXPEUMENTAL  RESULTS 

This  section  presents  several  examples  of  finding  ROI 
from  given  images.  In  these  examples,  the  similarity- 
angle  is  S  deg  and  6  ^  for  overlapping  conditions  is  60%. 
A  single  monochrome  image  is  used  for  each  of  the 
examples.  Ail  the  image  sizes  are  480  x  5 12.  The  simple 
PS  graph  is  used,  that  is,  the  intersection  of  convex 
hulls  instead  of  the  distance  determines  the  node  con¬ 
nectivity. 

Figure  1  shows  an  image  containing  an  electric 
transmission  tower.  The  image  is  processed  by  Burns’ 
algorithm  to  generate  line  segments  (Fig.  11).  These 
lines  enter  the  Feature  Extraction  module.  Very  short 
lines  (less  than  four  pixels  long)  are  eliminated;  lines 
likely  to  come  from  the  same  linear  structures  are 
merged  into  one  line;  and  coterminations  are  extracted. 
Figure  1 2  shows  the  lines  resulting  from  the  LS  extrac¬ 
tion.  Figure  13  shows  the  coterminations.  The  straight 
lines  and  CT  then  enter  the  PS  Formation  module  to 
identify  primitive  structures.  Figure  14  illustrates  the 
parallel  groups  thus  obtained,  and  Fig.  1 5  shows  poly¬ 
gon  PS.  Figure  16  shows  the  located  ROI  overlapped 
on  the  line  image  (Fig.  1 2).  The  significance  of  the  ROI 
is  1.0.  The  ROI  is  bounded  by  a  polygon  displayed 
with  a  bold  outline.  From  Fig.  16,  we  see  that  the  tower 


H.  Q.  Ll  and  J.  K.  AgGaKWal 


Detection  of  man-made  objects  in  non-urban  scenes 


847 


and  some  of  the  transmission  lines  are  property  included 
inside  the  polygon.  Other  transmission  lines  arc  not 
included  because  they  do  not  form  primitive  structures. 
Although  the  cluster  of  tree  leaves  is  very  close  to  the 
tower,  it  is  properly  separated  from  the  ROl.  We  can 
also  see  the  random  distribution  of  the  linear  edges 
detected  from  the  region  of  leaves. 

Figure  17  shows  another  tower  image  and  Fig  18  is 
the  located  ROl  overlaid  on  the  lines  after  LS  extrac¬ 
tion.  The  polygon  with  the  bold  outline  is  the  ROl. 
The  signiHcance  of  the  ROl  is  1 .0.  The  tower  and  most 
of  the  transmission  lines  arc  properly  enclosed  inside 
the  polygon. 

Figure  19  shows  an  image  of  a  concrete  bridge,  trees, 
and  a  river.  The  located  ROl  is  shown  in  Fig.  20.  The 
lines  in  Fig.  20  are  the  output  of  the  LS  Extraction. 
The  polygon  with  the  bold  outline  is  the  ROl.  The 
significance  of  the  ROl  is  1.0.  From  Figs  19  and  20,  we 
can  see  that  most  of  the  bridge  is  enclosed  inside  the 
ROl  except  for  its  far  end  near  the  image  boundary. 
Trees  and  the  river  are  properly  excluded  from  the 
ROl. 


Fig  16.  The  region  of  interest  for  the  tower  image. 
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Fig  IS.  Extnurted  polygon  PS  of  the  tower  image. 


Fig.  19.  An  image  with  a  bridge. 
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Fig  20.  The  region  of  interest  for  the  bridge  image. 


Fig.  21,  An  image  of  a  natural  scene. 


fie  22.  The  located  polygon  is  too  small  to  he  an  f '.  )l 
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Figure  2 1  is  a  natural  scene  of  trees,  sky,  and  a  pond. 
The  polygon  with  a  bold  outline  shown  in  Fig,  22  is 
the  largest  region  obtained  by  the  grouping  process. 
The  region  is  not  accepted  as  an  ROl  because  its 
significance  is  very  low  (p  =  0.37).  Therefore,  no  ROI 
is  found  in  this  image  In  fact,  there  are  no  large- 
man-made  objects  in  the  image. 

8.  CONCLLSION 

This  paper  presents  a  new  approach  for  the  detection 
of  large  man-made  objects  in  a  non-urban  area  using 
a  single  monochrome  image.  The  research  shows  how 
minimal  knowledge  and  information  about  the  domain 
can  best  be  used  for  a  computer  vision  task.  Prominent 
features  are  identified  that  distinguish  man-made 
objects  from  natural  objects.  We  propose  a  computa¬ 
tional  framework  to  locate  an  ROI  most  likely  to 
enclose  man-made  objects  or  a  substantial  part  of  the 
man-made  object.  The  framework  is  based  on  the 
principles  of  perceptual  organization.  Techniques  are 
developed  to  group  low  level  image  features  hierarchi¬ 
cally  into  the  ROT  Some  of  these  techniques  are  novel, 
while  others  present  unique  properties  and  advantages 
compared  with  previous  related  works. 

The  paper  presents  various  examples,  including 
images  of  different  kinds  of  man-made  objects  in 
complex  backgrounds  as  well  as  an  image  of  a  natural 
scene  without  man-made  objects,  to  show  the  approach’s 
effectiveness.  As  the  examples  show,  this  approach  is 
capable  of  locating  a  useful  ROI  in  complex  real 
images.  Hence,  the  search  space  is  reduced  from  the 
whole  image  to  the  ROI.  Further  analysis  of  the  ROI 
may  lead  to  the  identification  of  an  object  or  to  the 
rejection  of  the  existence  of  man-made  objects  in  the 
scene.  Therefore,  this  technique  of  locating  the  ROI 
may  be  used  for  an  initial  screening  of  a  large  number 
of  images  for  automatic  object  recognition  or  for  a 
human-machine  system.  For  an  automatic  system, 
when  specific  object  classes  are  given  and  models  are 
established,  primitive  structures  composing  the  ROI 
can  be  matched  to  object  models  instead  of  matching 
individual  features.  This  will  considerably  reduce  the 
search  space  for  matching  since  more  constraints  are 
applied.  For  a  human-machine  system,  the  ROI  can 
be  used  as  a  focus-of-attention  for  human  expertise  to 
further  examine  the  image. 

We  have  currently  used  straight  lines,  cotermina¬ 
tions,  parallel  PS,  and  polygon  PS  in  the  grouping 
process.  Hence,  man-made  objects  without  such  struc¬ 
tures  are  not  detected.  Future  research  will  consider 
features  and  primitive  structures  related  to  curves,  in 
addition  to  straight  lines. 
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APPENDIX  A.  FINDING  CLOSED  HCURES 

In  order  to  extract  polygon  primitive  structures,  we  need 
to  find  closed  figures  (polygons)  formed  by  line  segments.  This 
appendix  denves  a  method  of  finding  a  let  of  independent 
and  sufficient  closed  figures  using  a  CT  graph.  From  the 
definition  of  the  CT  graph,  it  is  clear  that  a  one-to-one  cor¬ 
respondence  exists  between  the  closed  figures  of  line  segments 
in  an  image  and  ihecircuir.s  in  the  CT  graph.  Thus,  the  problem 
of  finding  closed  figures  can  be  solved  by  finding  circuits  in 
the  graph.  There  may  be  many  closed  figures  in  an  image,  and 
some  of  them  may  be  redundant.  For  example,  two  polygons 
sharing  one  common  line  form  a  third  polygon,  which  is 
dependent  on  the  other  two.  Since  our  objective  is  to  find 
primitive  structures,  we  want  to  find  a  set  of  closed  figures 
that  have  a  simple  form  (that  is,  the  number  of  edges  is  small), 
are  independent  (to  avoid  redundancy),  and  are  sufficient  (to 
be  able  to  generate  all  closed  figures).  According  to  graph 
theory,  to  satisfy  the  latter  two  requirements,  we  just  need  to 
find  a  set  of  fundamental  circuits  in  the  CT  graph. 


The  set  offundamental  circuits  is  not  unique.  Correspond¬ 
ing  to  each  spanning  tree  of  a  graph,  there  is  a  set  of  funda¬ 
mental  circuits.  To  meet  the  simplicity  requirement,  we  want 
to  find  a  set  of  fundamental  circuits  such  that  the  number  of 
edges  in  the  set  is  small.  This  may  be  formulated  as  a  combi¬ 
natorial  optimization  problem;  given  a  graph,  find  a  spanning 
tree  such  that  the  number  of  edges  in  the  set  of  fundamental 
circuits  corresponding  to  this  tree  is  minimized.  We  can 
require  the  total  number  of  edges  to  be  minimized  or  the 
number  of  edges  in  the  largest  circuit  to  be  minimized.  If  the 
graph  is  plana,  and  if  the  graph  keeps  the  geometry  of  the 
line  segments  in  the  image,  the  optimal  solution  is  the  set  of 
meshes  (faces).  However,  a  CT  graph  may  be  non-planar 
because  line  segments  may  cross  each  other  in  an  image  To 
our  knowledge,  there  is  no  efficient  algorithm  available  to  the 
above  optimization  problem  for  a  non-planar  graph.  Alter¬ 
natively.  we  develop  a  heuristic  solution,  which  may  not  be 
mathematically  optimal  but  is  efficient. 

The  basic  idea  is  to  find  the  “bushiest"  spanning  tree.  For 
example.  Fig.  A1  shows  a  graph  and  two  ofthe  spanning  trees 
of  the  graph.  All  the  fundamental  circuits  corresponding  to 
tree  (a)  have  three  edges  whereas  those  corresponding  to  tree 
(b)  have  3, 4, 5, 6. 7,  and  6  edges,  respectively.  In  ihis  example, 
tree  (a)  is  the  bushiest  spanning  tree  among  all  the  spanning 
trees,  and  the  corresponding  fundamental  circuits  have  the 
minimum  number  of  edges.  To  formally  define  the  bushiest 
spanning  tree,  we  first  define  the  weight  of  an  edge  in  the  graph. 

Let  G  =  ( F,  £)  be  a  CT  g’-aph,  where  K  and  £  are  the  set  of 
vertices  and  the  set  of  edges  of  G,  respectively.  Let  e,jeE  be 
an  edge  connecting  vertices  ty,  tye  V.  Then  the  weight  of  Cij  is 
defined  as;  wlcy)  =  d(ty)  +  d(Vj),  where  d(  )  is  the  degree  of  a 
vertex,  that  is.  the  number  of  edges  incident  with  the  vertex. 
The  weight  of  a  spanning  tree  .f  is  the  sum  of  the  weights  of 
all  the  branches  in  J'.  A  spanning  tree  with  the  largest  weight 
in  G  is  called  the  maximal  spanning  tree.  We  define  the 
bushiest  spanning  tree  as  the  maximal  spanning  tree.  Let  us 
look  at  the  example  in  Fig.  A1  again.  The  numbers  on  each 
edge  of  the  graph  indicate  the  w  eights  of  the  edges.  Obviously, 
tree  (a)  is  the  maximal  spanning  tree  and  is  the  bushiest  one. 
The  maximal  spanning  tree  can  be  efficiently  found  by  slightly 
modifying  a  minimal  spanning  tree  algorithm.'^’'  Given  a 
spanning  tree  of  a  graph,  the  set  of  fundamental  circuits 
corresjxinding  to  this  tree  can  be  found  in  polynomial¬ 
time.'*” 

The  procedure  for  finding  a  set  of  closed  figures  satisfying 
our  requirements  is  now  straightforward.  After  extracting 
CT,  the  CT  graph  is  established  that  is  represented  by  an 
adjacency  matrix.  We  then  find  the  connected  components  of 
the  graph  and  process  each  subgraph  corresponding  to  each 
component  separately,  using  the  following  steps.  First  the 
weights  for  all  the  edges  in  the  subgraph  are  calculated.  Then 
the  maximal  spanning  tree  of  the  subgraph  is  found.  Finally, 
we  find  the  fundamental  circuits  corresponding  to  the  span¬ 
ning  tree.  Each  fundamental  circuit  represents  a  closed  figure 
in  the  image,  where  edges  on  this  circuit  correspond  to  line 
segments  on  the  closed  figure. 

The  above  procedure  is  efficient.  Let  n  be  the  number  of 
vertices  in  G.  Then  the  time  for  finding  connected  components 
is  0(n).  Let  G,-  be  a  connected  component  of  C,  and  let  n,  and 
nij  be  the  number  of  vertices  and  the  number  of  edges  in  G,. 
respectively.  The  time  for  finding  the  edge  weights  is  0(m,), 
and  that  for  finding  the  maximal  spanning  tree  is  0(n*). 
Finding  fundamental  circuits  requires  Ofnf  i-timc  for  a  sparse 
graph  and  0(fl?Hime  generally.  (A  CT  graph  is  most  likely  to 
be  a  sparse  graph.)  Therefore,  our  algorithm  is  usually- 
executed  in  0(n*Flime  and.  at  worst,  in  0(ir' I-timc.  It  is,  hence, 
a  polynomial-time  algorithm  while  the  direct  search  takes 
exponential-time.  Another  advantage  of  using  a  CT  graph  to 
find  closed  figures  is  that  the  existence  of  closed  figures  among 
a  given  set  of  line  segments  can  be  readily  determined.  In  fact, 
there  are  circuits  in  a  graph  only  if  the  number  of  edges  is 
greater  than  or  equal  to  the  number  of  vertices  in  the  graph. 
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(•) 


Fig.  Ai.  A  graph  and  two  of  iu  spanning  trees. 


(b) 


appendix  a  GROCPING  PRIMITIVE  STRUCTURES 

This  appendix  presents  an  algorithm  that  groups  PS  using 

the  PS  graph.  Let  =  { P°  =  i  =  1. 2 . rio}  be  the 

set  of  convex  hulls  of  the  Let  be  the  function 

that  maps  the  convex  hull  set  9  to  the  PS  graph.  Let  = 
( <  ■®,  if”)  =  d>(  be  the  PS  graph  of  the  PS  in  the  image,  where 
i is  the  set  of  vertices,  and  the  set  of  links.  An  algorithm 
described  in  reference  (36)  is  applied  to  find  the  connected 
components  of  a  graph. 

Let  ;  irf ,  i  =  1, 2, . . . ,  mo}  be  the  set  of  connected  com¬ 
ponents  of'#”.  Then  and  <()  ~  ‘f*’”)  £  is  either  one 

convex  hull  or  a  set  of  convex  hulls  that  intersect  or  are  close. 
In  the  latter  situation,  the  PS  whose  regions  belong  to 
4>  ■  '(4'?)  are  grouped  into  a  larger  structure  PS*.  The  region 
of  the  PS,'  is 


p;=«(ps/)-sf(U,^.,.,,^pp. 

For  example,  in  Fig.  10,  the  regions  (convex  hulls)  of  the 
three  PS  intersect.  Figure  10(d)  shows  the  region  of  the 
grouped  1%  when  these  three  PS  are  grouped.  If  Ik”  has  only 
one  vertex,  that  is,  ^ '*(-#?)  represents  one  PS,  PSj,  then 
PSj*  =>  PS^,  and  P/  =  P®.  All  the  connected  components  of 
are  thus  processed  to  produce  possible  groupings  of  PS  and, 
hence,  a  segmentation  of  the  image. 

A  new  PS  graph  9*  «=  (f'*,  #' )  =  )  is  then  established. 

where  ^'  =  {P,*  ='if(PS?),  i  =  l,2 . n,}.  The  connected 

components  of  9*  are  found,  leading  to  further  grouping  of 
PS.  This  process  continues  until  no  PS  can  be  further 
grouped.  That  is,  the  iteration  stops  at  #‘ =  (»  *,<?*)  with 
#*  =  0.  Our  experiments  show  that  only  a  small  number  of 
iterations  is  actually  needed. 
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ABSTRACT 

This  paper  describes  a  method  to  perform  distance 
measurements  inside  buildings  using  a  single  video  cam¬ 
era  mounted  on  a  mobile  robot.  This  algorithm  is  de¬ 
signed  specifically  to  create  architectural  floor  plans  with 
accurate  measurements.  Several  issues  associated  with  ar¬ 
chitectural  surveying  are  discussed  in  detail,  such  as  the 
visibility  of  features  and  the  accuracy  in  calibration.  Re- 
aults  of  the  edge  reconstruction  in  an  indoor  scene  are 
presented  and  compared  to  an  architect’s  drawing  of  the 
building.  Applications  of  this  method  include  the  verifi¬ 
cation  of  constructed  buildings,  the  update  or  creation  of 
civil  engineering  CAD  models,  and  the  acquisition  of  data 
for  architectural  graphics  simulations. 


INTRODUCTION 

The  perception  system  of  most  mobile  robots  is  geared 
towards  navigation,  where  qualitative  information  is  more 
important  than  a  high  accuracy.  We  propose  here  a  differ¬ 
ent  task:  the  automatic  metrical  surveying  of  a  building 
using  a  single  camera  mounted  on  a  mobile  robot.  The 
robot  provides  a  sequence  of  images  registered  with  odo- 
metric  estimates  of  the  motion  between  frames.  The  vi¬ 
sion  algorithm  reconstructs  the  3-D  scene  by  using  the 
known  motion  of  the  camera.  In  order  to  extract  the  most 
useful  features  for  reconstruction,  we  concentrate  on  seg¬ 
ments  that  have  a  particular  orientation  in  the  3-D  scene. 
In  most  buildings,  there  are  three  prominent  3-D  orien¬ 
tations:  the  vertical  and  two  horizontal  3-D  orientations 
perpendicular  to  each  other.  This  assumption  is  often  used 
with  indoor  scenes  [I,  2,  7,  8,  9],  and  wiU  hold  for  the  rest 
of  this  paper. 

We  begin  by  describing  algorithms  to  extract  useful 
line  segments  from  each  image  and  to  reconstruct  the  3-D 
scene.  We  then  discuss  several  issues  of  particular  impor¬ 
tance  for  practical  applications  in  architectural  surveying, 
such  as  the  visibiUty  of  segments  In  typical  indoor  scenes. 
The  following  section  concentrates  on  accuracy  issues  and 
explains  the  calibration  procedure  for  the  camera  and  the 
robot.  Finally,  we  provide  a  comparison  between  recon¬ 
structed  edges  and  the  corresponding  architectural  floor 
plan. 


a-D  RECONSTRUCTION 

Line  segments  corresponding  to  particular  orienta¬ 
tions  in  3-D  are  extracted  from  each  image  using  a  special 
line  detector  based  on  vanishing  pmnts  (4].  The  detectioa 
and  interpretation  process  provides  a  3-D  orientation  hy¬ 
pothesis  for  each  2-0  segment.  Vertical  segments  are  of 
particular  importance  for  floor  pitas,  but  other  particu¬ 
lar  3-D  orientations  can  be  processed  as  well.  Segments 
that  do  not  have  any  of  the  predefined  3-D  orientations 
are  ignored. 

The  3-D  potilion  of  segments  is  estimated  from  a  se¬ 
quence  of  images  using  a  method  based  on  Kalman  filter¬ 
ing  (5,  6].  The  position  of  each  3-D  segment  is  represented 
by  an  estimate  and  an  associated  Gaussian  uncertainty. 
The  floor  plan  can  then  be  constructed  from  these  esti¬ 
mates.  in  our  approach,  each  3-D  orientation  is  treated 
separately.  This  is  possible  since  the  line  detector  indi¬ 
cates  the  3-D  orientation  of  the  segments.  Matching  it 
therefore  simplified,  as  is  the  complexity  of  prediction  and 
update:  all  the  computation  it  done  in  planes  perpendic¬ 
ular  to  each  one  of  the  predefined  3-D  orientations. 

APPLICATION  TO  ARCHITECTURE 

The  application  of  our  reconstruction  algorithms  to 
architectural  surveying  imposes  a  few  practical  constraints. 

Field  of  view:  In  a  typical  indoor  scene,  there  are 
relatively  few  airhittcturotly  significant  features,  such  as 
room  corners  and  doorways.  Those  features  are  usually 
far  apart.  However,  the  robot  should  tee  as  many  features 
as  possible  at  any  time.  For  this  reason,  we  equipped  the 
camera  with  a  wide-angle  lent  (6  mm  focal  distance  for 
a  2/3  inch  CCD  camera).  The  advantage  is  that  more 
interesting  edges  can  be  tracked  at  once.  In  long  and  nar¬ 
row  corridors,  segments  can  be  observed  not  only  from  a 
distance,  but  also  as  the  robot  passes  by  them.  Those 
segments  are  therefore  acquired  under  very  different  an¬ 
gles,  thus  reducing  the  uncertainty  in  their  reconstruction. 
With  a  wide-angle  lens,  the  forward-looking  camera  can 
still  see  edges  lying  almost  along  a  perpendicular  to  the 
direction  of  motion.  For  those  edges,  the  precision  of  re¬ 
construction  is  similar  to  that  of  a  stercovision  system  with 
widely  spaced  cameras.  The  accuracy  is  then  very  high, 
assuming  that  the  exact  motion  of  the  camera  is  known. 

The  drawback  to  using  a  wide-angle  lens  is  the  asso¬ 
ciated  barrel  distortion  (see  Figure  1).  In  the  next  section, 
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we  wiU  explain  bow  this  distortion  is  calibrated  and  cor¬ 
rected. 

Motion  determination;  We  have  assumed  so  far 
that  the  exact  motion  of  the  camera  was  given  to  the  3-0 
reconstruction  algorithm.  Although  the  robot's  odometers 
provide  a  position  and  a  heading,  these  measurements  can¬ 
not  be  used  alone,  since  odometers  drift  without  bounds. 
Odoraetry  may  be  adequate  for  estimating  the  motion  be¬ 
tween  a  few  images,  but  it  is  insufficient  for  the  long  se¬ 
quences  necessary  in  mapping  an  entire  building. 

The  robot  is  equipped  with  an  odometer  on  its  right 
and  left  driving  wheels.  In  practice,  translation  measure¬ 
ments  are  very  good  because  they  are  derived  from  the  av¬ 
erage  of  the  two  odometers.  However,  rotations  (changes 
in  robot  heading)  drift  much  faster  because  they  rely  on 
the  difference  between  the  two  odometers.  For  this  reason, 
we  correct  the  odometric  heading  with  vision  every  time  an 
image  is  processed.  By  extracting  the  exact  position  of  the 
vanishing  points  in  the  image,  the  orientation  of  the  cam¬ 
era  with  respect  to  the  scene  is  computed  precisely.  The 
heading  of  the  robot  is  estimated,  as  well  as  the  roll  and 
pitch  errors  due  to  uneven  floor  surface.  This  technique 
prevents  the  beading  from  drifting  without  bounds.  The 
position  of  the  robot,  obtained  by  integrating  small  dis¬ 
placements,  can  still  drift  because  of  errors  in  translations, 
but  they  build  up  much  slower  than  errors  in  rotations. 

Using  this  technique,  the  robot's  positional  error  was 
reduced  by  several  orders  of  magnitude  after  a  rectangular 
trajectory  of  12S  meters  around  a  building  floor  (see  [5] 
for  details). 

ACCURACY  AND  CALIBRATION 

For  any  practical  application  to  architectural  survey¬ 
ing,  the  precision  of  measurement  is  paramount.  An  ac¬ 
curate  calibration  procedure  is  necessary.  In  this  section, 
we  describe  the  calibration  procedure  for  the  intrinsic  pa¬ 
rameters  of  the  camera,  including  the  barrel  distortion,  as 
well  as  for  the  extrinsic  parameters  (camera-robot  rela¬ 
tionship). 

The  Optical  axis:  We  first  determine  the  intersec¬ 
tion  of  the  image  plane  and  the  optical  axis,  expressed  in 
pixel  units  in  the  coordinate  system  of  the  frame  grabber. 
The  method  we  use  is  optical.  After  closing  the  iris  of  the 
camera,  a  low-power  laser  beam  is  shone  through  a  perfo¬ 
rated  white  screen  into  the  center  of  the  lens.  The  beam 
is  partially  reflected  by  the  lens  onto  the  screen.  The  goal 
is  to  align  the  laser  beam  and  the  optical  axis  of  the  lens. 
By  carefully  adjusting  the  pan  and  tilt  of  both  the  laser 
and  the  caimera,  the  beam  is  made  to  reflect  onto  itself. 
At  that  point,  an  attenuating  filter  is  placed  in  the  beam 
and  the  iris  is  slightly  opened.  A  bright  spot  becomes 
visible  in  the  image,  indicating  the  location  of  the  optical 
axis  on  the  retina.  This  method  is  not  recommended  with 
non-CCD  cameras. 

The  projection  model:  The  next  calibration  step 
consists  of  identifying  the  focal  length  and  the  distortion 
of  the  lens.  The  distortion  is  zero  on  the  optical  axis,  and 
non-negligible  near  the  borders  of  the  image  rectangle.  As 
a  simplification,  we  assume  that  the  image  is  formed  in  two 
steps.  Firstly,  the  scene  is  projected  onto  the  image  plane 
by  a  standard  pinhole  perspective  projection.  Secondly, 
the  image  is  distorted  in  the  image  plane  by  a  2-D  to  2-D 
function.  This  assumption  means  that  the  bartei  distor¬ 
tion  docs  not  depend  on  the  distance  between  the  lens  and 


the  scene,  and  holds  if  the  diameter  of  the  lens  is  not  too 
large. 

The  standard  perspective  projection  is  completely  de¬ 
fined  by  the  location  of  the  optical  axis  in  the  image  (de¬ 
termined  in  the  previous  subsection)  and  the  focal  length 
(to  be  determined  later  for  points  near  the  optical  axis, 
where  the  distortion  is  ne^gible).  The  second  step,  a  2-U 
distortion  in  the  image  plane  will  then  modeled  by  cali¬ 
bration.  This  model  is  used  to  correct  the  distortion  in 
digitixed  images  [3]. 


Figure  1:  The  calibration  pattern  distorted  by  the 
wide-angle  lens 


The  calibration  pattern:  We  first  measure  the  lo¬ 
cation  of  a  set  of  data  points  covering  the  image.  To  obtain 
the  data  points,  we  place  the  camera  in  front  of  a  large 
calibration  pattern  on  a  wall  (see  Figure  1).  The  pat- 
tern  consists  of  a  grid  of  black  dots  placed  every  10  cm. 
For  best  results,  the  distance  between  the  camera  and  the 
pattern  should  approximate  the  typical  distance  to  objects 
when  the  robot  is  in  operation.  In  practice,  this  leads  to  a 
calibration  pattern  as  large  as  the  scene  itself,  if  possible. 
The  size  of  our  pattern  is  2.4  x  1.8  meters. 

We  first  need  to  ensure  that  the  image  plane  is  paral¬ 
lel  to  the  pattern,  so  that  the  perspective  projection  will 
not  interfere  with  the  2-D  distortion  in  the  image  plane. 
To  achieve  this  constraint,  we  place  a  mirror  flat  on  the 
pattern.  We  then  adjust  the  pan  and  tilt  of  the  camera 
until  the  reflection  of  the  camera  by  the  mirror  coincides 
with  the  location  of  the  optical  center  in  the  image  plane. 
To  facilitate  this  operation,  we  digitally  superimpose  cross 
hairs  centered  on  the  the  optical  axis  over  the  live  image 
from  the  camera.  Using  the  cross  hairs,  any  roll  in  the 
camera’s  orientation  is  also  eliminated  at  this  stage.  Fi¬ 
nally,  the  image  of  the  pattern  is  digitized  and  the  data 
points  are  automatically  extracted. 

The  focal  length:  The  next  intrinsic  parameter  to 
be  determined  is  the  focal  length,  expressed  in  horizontal 
and  vertical  pixel  units.  The  focal  length  is  computed  for 
points  lying  close  to  the  center  of  the  image,  where  the 
barrel  distortion  is  negligible.  The  distance  D  between 
the  camera  and  the  calibration  pattern  is  physicaUy  mea¬ 
sured,  and  expressed  in  millimeters.  Let  L  be  the  distance 
(in  millimeters)  between  two  points  close  to  the  center  of 


the  pattern,  and  I  the  distance  (in  pixels)  hetween  their 
perspective  projections.  Neglecting  the  barrel  distortion, 
the  focal  length  /  is  given  in  pixels  by  /  =  IDf L.  Since 
the  image  pixel?  arc  not  neces.sary  .square,  the  focal  length 
needs  to  be  expressed  both  in  horizontal  and  vertical  pixel 
units. 

Barrel  distortion:  At  this  point,  all  the  parame¬ 
ters  for  a  conventional  perspective  projection  have  been 
determined.  However,  most  points  of  the  pattern  do  not 
project  exactly  to  their  theoretical  location  or  the  image 
plane  because  of  the  barrel  distortion.  We  automatically 
measure  the  resulting  2-D  displacement  vectors  in  the  im¬ 
age  plane,  as  the  difference  between  the  theoretical  and 
observed  coordinates  of  projected  dots.  The  2-D  distor¬ 
tion  function  is  estimated  by  linear  interpolation  between 
data  points  [3].  Digitized  images  can  then  be  corrected  to 
eliminate  barrel  distortion.  The  intensity  of  every  pixel  in 
the  corrected  image  is  found  by  looking  up  the  intensity 
at  the  appropriate  location  in  the  original  distorted  image. 
In  order  to  speed-up  the  distortion  correction,  a  look-up 
table  is  computed  at  the  time  of  calibration  to  translate 
corrected  coordinates  into  raw  coordinates. 

Extrinsic  parameters:  We  then  measure  the  rota¬ 
tion  and  translation  of  the  camera  relative  to  the  robot's 
coordinate  system.  The  translations  are  physically  mea¬ 
sured.  The  roll  is  forced  to  zero  by  aligning  cross  hairs 
digitally  superimposed  on  the  image  to  the  horizontal  axis 
of  the  calibration  pattern,  as  described  previously.  Next, 
the  pan  and  tilt  are  adjusted  until  the  robot  is  able  to 
back  away  from  the  calibration  pattern  in  a  straight  line 
while  keeping  the  same  point  of  the  pattern  under  the 
cross  hairs  of  the  optical  axis.  This  process  results  in  zero 
pan  and  tilt.  If  a  non-zero  pan  or  tilt  is  desired,  the  cam¬ 
era  is  rotated  from  the  zero-angle  position  by  a  controlled 
angle. 

RESULTS 

Figure  2  shows  a  short  sequence  of  images  acquired 
by  the  robot.  The  corresponding  line  segment  images  arc 
given  in  Figure  3.  The  result  of  the  reconstruction  of  verti¬ 
cal  edges  using  these  four  images  is  presented  in  Figure  4, 
and  compared  to  the  architectural  floor  plan  of  the  build¬ 
ing.  Vertical  edges  are  indicated  by  small  crosses.  The 
last  position  of  the  robot  is  represented  by  a  small  square 
in  the  middle  of  the  corridor.  The  grid  has  a  one-meter 
spacing.  The  door  in  the  top  right  corner  was  closed,  and 
some  segments  on  it  are  reconstructed.  A  few  segments 
with  no  architectural  interest  arc  picked  up.  For  example, 
two  of  them  lie  on  the  wall  at  the  bottom  of  the  figure. 
The  segments  in  the  top-left  corner  of  the  figure  appear  to 
have  been  shifted  to  the  right.  After  measuring  the  actual 
dimensions  in  the  scene,  we  found  that  the  corner  of  the 
corridor  was  built  apprjximately  5  cm  to  the  right  of  the 
position  shown  on  the  architictural  drawing.  The  robot 
therefore  correctly  indicated  this  discrepancy,  although  it 
overestimated  it  slightly. 

CONCLUSION 

We  proposed  a  method  for  the  task  of  surveying  build¬ 
ings  using  mobile  robots.  We  addressed  several  important 
issues  resulting  from  the  particular  requirements  of  archi¬ 
tectural  applications,  and  we  presented  results  obtained  in 
the  reconstruction  of  indoor  scenes.  Finally,  we  compared 


those  results  to  the  architect's  drawing  of  the  floor  plan 
Improvements  to  our  approach  are  possible  in  several  a>- 
eas.  to  further  increase  the  precision  of  measurements  and 
to  produce  C.AD-like  building  maps.  We  believe  that  au¬ 
tomatic  architectural  surveying  using  mobile  robots  is  a 
feasible  task  in  the  fairly  near  future.  .Applications  will  in¬ 
clude  the  verification  of  constructed  buildings,  the  update 
or  creation  of  civil  engineering  CAD  models,  the  acqui¬ 
sition  of  data  for  architectural  graphics  simulations,  and 
the  creation  of  maps  for  use  by  other  robots. 
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ABSTRACT 

The  key  issue  in  motion  estimation  and  tracking  an 
object  over  a  sequence  of  images  is  establishing  correspon¬ 
dence  between  the  features  of  the  object  in  the  different 
images  of  the  sequence.  For  range  image  sequences,  this 
problem  translates  into  finding  a  match  between  the  sur¬ 
face  segments  in  a  pair  of  range  images  of  the  scene.  This 
paper  considers  the  problem  of  establishing  correspon¬ 
dences  between  surfaces  in  a  sequence  of  range  images.  We 
present  a  novel  procedure  for  finding  correspondence  and 
show  the  results  on  real  range  image  sequences.  A  graph 
search  procedure  forms  the  basis  for  the  algorithm  that 
computes  the  correspondence  between  surfaces.  The  solu¬ 
tion  uses  geometrical  and  topological  information  derived 
from  the  scenes  to  direct  the  search  procedure.  Fundamen- 
•al  to  our  strategy  to  match  features  over  a  sequence  of 
mge  images  is  a  hypergraph  representation  of  the  scenes. 
Two  scenes  arc  modeled  as  hypergraphs  and  the  hyper¬ 
edges  are  matched  using  a  sub-graph  isomorphism  algo¬ 
rithm.  The  hierarchical  representation  of  hypergraphs  not 
only  reduces  the  search  space  significantly,  but  ^so  facil¬ 
itates  the  encoding  of  the  topological  and  geometrical  in¬ 
formation.  Further,  we  present  a  sub-hypergraph  isomor¬ 
phism  procedure  to  establish  the  correspondences  between 
the  surface  patches  and  demonstrate  the  algorithm  on  dif¬ 
ferent  types  of  real  range  image  sequences.  We  present  re¬ 
sults  that  show  thi .  the  algorithm  is  robust  and  performs 
well  in  presence  of  occlusions  and  incorrect  segmentations. 

INTRODUCTION 

The  key  issue  in  motion  estimation  and  tracking  an 
object  over  a  sequence  of  images  is  establishing  correspon¬ 
dence  between  the  features  of  the  object  in  the  different 
images  of  the  sequence.  In  this  paper  we  deal  with  the 
tracking  of  objects  in  a  sequence  of  range  images  to  esti¬ 
mate  the  motion  of  the  camera  (range  sensor)  in  the  envi¬ 
ronment.  Range  images  sense  the  surface  of  the  objects, 
so  it  is  natural  to  use  surface  segments  as  the  features  of 
interest;  this  translates  the  tracking  of  objects  into  finding 
a  match  between  the  surface  segments  in  a  pair  of  range 
images  of  the  scene.  This  paper  considers  the  finding  of 
correspondences  between  surfaces  in  a  sequence  of  range 
images.  Finding  correspondence  or  a  match  between  fea¬ 
tures  is  not  isolated  to  object  tracking,  but  is  also  central 
to  other  computer  vision  tasks  including  navigation,  ab¬ 
ject  recognition,  target  tracking,  and  map  building.  We 
present  a  novel  procedure  for  establishing  correspondence 
and  show  the  results  on  real  range  image  sequences. 


A  graph  search  procedure  forms  the  basis  for  the  algo¬ 
rithm  that  computes  the  correspondence  between  surfaces. 
The  solution  uses  geometrical  and  topological  information 
derived  from  the  scenes  to  direct  the  search  procedure.  In 
general,  the  input  to  the  matching  algorithm  is  the  output 
from  a  segmentation  algorithm  that  partitions  the  image 
into  surface  segments.  The  performance  of  the  matching 
depends  greatly  on  the  results  of  the  segmentation  algo¬ 
rithms.  Incorrect  segmentation  causes  poor  estimation  of 
the  surface  parameters  and  affects  the  performance  of  the 
matching  algorithm.  We  address  this  issue  and  obtain  a 
solution  that  is  robust  and  able  to  handle  occlusions  of 
surfaces,  noise  in  data,  and  incorrect  segmentation  from  a 
segmentation  algorithm.  In  the  present  impiementation, 
we  assume  that  the  images  have  planar,  cylindrical  and 
conical  surfaces;  however,  the  procedure  is  genera!  enough 
to  be  extended  to  other  surface  classes. 

The  question  of  finding  correspondences  between  fea¬ 
tures  has  been  studied  extensively  (see  [1,  3,  4,  5j)  but, 
most  of  these  approaches  deal  with  matching  a  scene  to  a 
model  of  the  object.  The  fundamental  difference  between 
modcl-to-scene  matching  and  scene- lo-scene  matching  is 
that  in  the  former,  the  model  description  of  the  object  is 
complete,  and  to  that  we  match  the  incomplete  descrip¬ 
tion  of  the  object  obtained  from  the  scene.  However,  in 
the  case  of  scene- to- scene  matching,  both  descriptions  of 
the  object  are  incomplete  and  we  must  find  a  match  be¬ 
tween  two  incomplete  descriptions.  By  incomplete,  we 
mean  that  all  the  features  are  not  present  in  the  descrip¬ 
tion  of  the  object  because  of  occlusions  and  sensor  errors. 
This  difference  makes  it  impossible  to  use  the  strategies 
obtmned  for  object  recognition  in  the  domain  of  object 
tracking;  new  strategies  based  on  the  constraints  of  the 
problem  have  to  be  designed. 

Fundamental  to  our  strategy  to  match  features  over 
a  sequence  of  range  images  is  a  hypergraph  representa¬ 
tion  of  the  scenes.  The  two  scenes  a.e  modeled  as  hyper¬ 
graphs  and  the  hyperedges  are  matched  using  a  sub-graph 
isomorphism  algorithm.  To  reduce  the  complexity  of  the 
matching  task,  heuristics  derived  from  the  topological  and 
the  geometrical  information  available  from  the  scene  are 
used  to  direct  the  search.  The  hierarchical  representation 
of  hypergraphs  not  only  reduces  the  search  space  signifi¬ 
cantly,  but  also  facilitates  the  encoding  of  the  topological 
and  geometrical  information.  Hyperedges  are  formed  by 
grouping  the  surface  features,  which  reduces  the  search 
space.  Using  a  priori  knowledge  arising  out  of  the  physi¬ 
cal  constraints  of  laser  scanning,  a  fast  matching  algorithm 
is  designed. 
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HYPERGRAPH  REPRESENTATION 


Hypergraphs  are  generalizations  of  graphs.  The  edge 
is  generalized  as  a  hyperedge,  where  a  set  of  vertices  forms 
the  hyperedge,  instead  of  just  two  vertices  forming  the 
edge.  The  group  of  vertices  forming  the  hyperedge  may 
share  some  common  property.  Hypergraphs  have  been 
used  earlier  in  vision  and  robotics  applications  {11,  12], 
but  have  not  found  widespread  usefulness.  We  present 
a  new  definition  of  the  hyperedge  and  a  novel  method  for 
constructing  the  hypergraphs  that  makes  it  a  powerful  tool 
for  vision  applications. 

Attributed  hypergraphs  are  a  concise  way  of  repre¬ 
senting  objects  such  that  both  quantitative  and  qualitative 
information  are  encoded  in  the  representation.  Formally; 

Definition  1  The  Hypergraph  ft)  is  defined  as  an  ordered 
pair  H  =  (X,E)  where  X  =  {ri, zj, •  s^n}  «s  o 
set  of  attributed  vertices  and  E  =  {ei.ej,  •  •  -.e^}  are  the 
hyperedges  of  the  hypergraph.  The  set  E  is  a  family  of 
subsets  of  X  (i.e.  each  e,-  is  a  subset  of  X )  such  that 

1.  e,  #  Z,i  =  1,  ■  ■  ■,"» 

2.  ur=ie.  =  -^- 

A  graph  is  a  hypergraph  whose  fayperedges  have  cardinal¬ 
ity  of  two.  To  each  hyperedge,  we  associate  an  attribute 
set  that  maps  the  vertices  (belonging  to  the  hyperedge)  to 
an  attributed  graph. 

Each  surface  patch  in  the  range  image  forms  an  at¬ 
tributed  vertex.  The  attribute  values  are  the  surface  prop¬ 
erty  values.  For  each  pair  of  surfaces  that  are  connected, 
an  attributed  arc  is  formed.  The  attributes  of  the  arc 
describe  the  interfacing  edge  and  the  relative  geometri¬ 
cal  information  between  the  two  surfaces.  Groups  of  the 
attributed  vertices  (surface  patches)  form  an  hyperedge, 
and  with  each  hyperedge  we  associate  an  attributed  graph 
that  describes  the  topology  of  the  component  attributed 
vertices  (surface  patches). 

The  set  of  vertices  that  form  the  hyperedge  should 
represent  a  topologically  significant  feature  in  the  graph 
so  that  the  matching  task  is  guided  by  the  topology  of  the 
scene.  Cliques  in  the  graph  are  significant  features  that 
are  rich  in  inform;>tion.  Physically,  the  cliques  represent 
groups  of  surfaces  that  are  adjacent  to  each  other.  Since 
a  clique  provides  a  larger  attribute  set  and  many  geomet¬ 
rical  properties,  the  probability  of  a  false  positive  match 
(between  two  cliques)  is  reduced  significantly.  Each  clique 
forms  a  hyperedge  in  the  hypergraph  and  the  attributed 
graph  describing  the  clique  is  the  associated  attribute  of 
the  hyperedge.  Figure  1  illustrates  the  formation  of  a  hy¬ 
pergraph  from  a  scene. 

The  complexity  of  computing  the  ;liques  in  a  graph 
is  exponential,  so  the  formation  the  hypergtaph  will  be 
exponential.  However,  the  physics  of  the  range  imaging 
process  restricts  the  size  of  the  cliques  in  the  scenes  that 
we  observe.  It  can  be  shown  that  the  size  of  the  clique 
is  restricted  to  be  four  |8].  Once  the  upper  bound  on  the 
size  of  the  cliques  is  known,  the  complexity  of  computing 
the  cliques  becomes  0(n). 

THE  MATCHING  PROCEDURE 

This  section  presents  the  matching  procedure  used  to 
derive  t  tic  surface  correspoiidciif  es  in  a  sequence  of  range 
images  Tlic  heart  of  the  procedure  is  a  ditxcttd  tree  search 


Figure  1;  An'object  and  its  corresponding  Hypergraph  rep¬ 
resentation 

algorithm  that  tests  various  hypotheses  and  rejects  the 
impossible  ones.  Finally  the  interpretation  that  gives  the 
largest  match  is  selected  as  the  solution.  Constrained  tr^ 
search  algorithms  have  been  used  in  many  applicutlon- 
|4,  5,  7).  Data  pairings  are  formed  by  a  depth  first  search 
of  an  interpretation  tree.  Each  node  ot  the  tree  repre¬ 
sents  a  possible  pairing.  The  first  data  (surface  patch) 
is  taken  from  the  first  scene  and  paired  with  each  of  the 
data  in  the  second  scene.  These  form  the  nodes  in  the  first 
level  of  the  trM.  To  account  for  missing  surface  segments 
due  to  occlusibns,  the  data  is  also  paired  with  a  wild  card 
«.  Subsequent  levels  of  the  tree  correspond  to  pairings  of 
other  vertices.  Each  branch  of  the  tree  represents  a  partial 
matching  of  the  scenes.  The  constraints  are  used  to  prune 
the  search  tree  and  thus  reduce  the  search  space. 

We  present  a  variation  to  the  constrained  tree  search, 
in  which  the  search  is  directed  based  on  the  current  hy¬ 
pothesis.  The  directed  search,  coupled  with  the  termina 
tion  conditions,  further  reduces  the  search  space.  Thu  key 
idea  is  to  use  the  topological  constraints  of  the  scene  to 
determine  the  next  most  likely  match,  and  to  accept  or 
reject  the  matches  based  on  the  geometrical  constraints. 

The  features  used  in  the  matching  process  are  sur¬ 
face  segments.  We  assume  that  a  segmentation  algorithm 
|9,  lOJ  segments  the  range  image  into  surface  patches  and 
the  surface  parameters  are  computed.  The  interfacing 
edge  between  the  surface  patches  ate  detected  and  their 
properties  are  computed.  The  propecties  of  the  edge  seg¬ 
ments  used  are  (1)  the  edge  type  (,  :  i  ’ght  line  or  curved), 
(2)  the  edge  length,  and  (3)  the  depth  discontinuity.  The 
depth  discontinuity  across  the  edge  implies  that  one  sur¬ 
face  may  be  occluding  (partially  ot  completely)  another 
surface.  The  information  about  occlusion  is  also  incori>o- 
rated  in  the  attribute  list  of  the  surface  patches. 

The  constraints  used  are  similar  to  the  unary  and  bi¬ 
nary  constraints  developed  by  Crimson  and  Lozano-Perez 
[6].  The  only  unary  constraint  we  use  is  the  surface  type 
classification  (planar,  cylindrical,  conical,  etc.).  Other 
properties,  used  in  model  based  object  recognition,  such 
as  area,  perimeter,  compactness,  etc.,  are  very  sensitive 
to  occlusion,  and  since  occlusion  may  occur  in  either  of 
the  range  images,  these  properties  cannot  be  used  as  con¬ 
straints.  The  binary  constraints  describe  the  relative  prop¬ 
erties  between  pairs  of  surface  segments.  The  properties 
we  use  are  (1)  connectivity,  (2)  the  angle  between  the  sur¬ 
face  patches,  (3)  the  range  of  distances  between  the  two 
surface  patches,  (4)  the  range  of  the  components  of  the 
vector  spanning  the  two  surface  patches,  and  (5)  the  pr.  ,>■ 
erlics  of  the  interfacing  edge.  Each  conslraint  is  measured 
and  tested  against  a  predetermined  threshold.  For  surface 


:  ^i\ ‘  .,r;  or r  ^  in 

lorniation  is  not  tu;:i|Jii  !i’  ,.i  !,<’,i>i,bur  rti.i;.  be  hiiidojs)  anii 
the  connectivity  information  may  be  inaccurate.  There¬ 
fore,  for  such  cases  only  a  ueai  match  is  hypothesized 
which  is  subject  to  conformation  or  rejection  baaed  on 
further  evidence. 

Matching  between  the  two  hypergraphs  representing 
the  scenes  is  achieved  by  computing  the  match  between 
the  component  hyperadges  A  match  between  the  two  hy 
peredges  is  hypothesized  The  two  hyperedges  are  matched 
by  matching  the  attributed  graphs  representing  the  hy- 
peredges.  An  order  of  vertices  is  estab'.ished  at  each  stage 
of  the  match.  Fhe  onlei  deiermincs  the  branches  taken 
in  the  search  tree  The  order  is  determined  by  listing 
the  ;,yporedges  mnnerted  to  the  vertices  that  have  been 
matched  in  curnmi  hiypoihcsis.  The  matching  procedure 
starts  by  selecting  the  large^i  hypercilge  //,  and  //J  in  the 
two  scenes.  The  vertev  with  the  largest  degree  is  selected 
as  the  first  node  m  and  it  is  matched  with  the  corre¬ 
sponding  vertex  In  tiie  semnd  hyperedge.  The  unary  and 
the  binary  constraints  are  checked  to  evaluate  the  match 
between  the  hyperedges.  Once  the  hyperedge- match  has 
been  estabiishe-i .  t  he  sernr.d  set  of  hyrieredges  are  selected 
The  next  hspereiige  ,r/_,  !<;  the  hyperedge  connected  to  //) 
at  rjj .  A  match  for  e.,*-  ,h  *-,1 '  Se  h i- peredges  connected  to  //j 
at  H)  is  found,  fhe  search  then  proceeds  to  find  matches 
of  hyperodges  rcinnericl  -n  // j  at  other  vertices  belonging 
to  // 1 .  The  prfice,-t-;rr'  eo.'s  do'.s  n  the  list  of  alt  the  sertice* 
in  'ho  hvpergraph'  ;t  ’lie  irder  eva!  iateti  earlier.  Once  a 
m.,ich  for  a  h-.peredge  ts  Irtu".,!  that  h.pefedgc  is  marked 
a.s  malrhrd.  The  rtiarsed  hyperetjges  are  not  considcrei!  ir, 
the  f'lturo  hypo' iieses. 


F.gure  2:  The  drpth  rrinp.s  o]  a  sequence  of  range  images. 


Figure  3:  The  segrienird  ro  ,gr  images 


figures  2  i  illustrate  the  algorithm  on  an  example 
higiire  2  shows  the  depth  maps  nf  two  frames  m  the  se 
quence  of  range  images  The  scenes  consist  of  a  jumble  of 
different  kind*  of  objects  The  camera  is  moved  to  obtain 
the  second  frame  of  the  sequence  The  segme/itation  ah 
goriihm  of  [iO'  wa.s  applied  on  the  images  and  the  results 
input  to  the  matching  algorithm  The  seg 
are  shown  m  figure  .3  The  first  step  of  the  algorithm 
generates  the  a'lnbut.-d  graph  of  the  scene  and  corTipuies 
the  cliques  ir.  'he  graph  f.ach  clique  forms  a  Ksperedge 
in  the  gcnerateii  hypereraph  The  hspergraphs  gererated 
are  shown  m  'he  figure  t  for  each  h>  perecige  *he  rc-mpo 
nent  vertices  form  an  altrd  oted  graph  In  the  fig-i'e  the 
arcs  of  the  attriiiutcd  graph  .a'e  shown  in  the  hipere.-fges 
Tsing  the  propofPes  of  the  riige,  ouerfacing  two  ~  :tU-e 
segments,  it  is  determined  d  'wo  surfaces  are  cor:ne''e,j  if 
there  exists  ar.  occluding  e.|g,.  between  two  surfaces  then 
the  arc  in  tin-  at’rib’it.’ct  gi.ijih  ;s  ucnii  'shown  in  the  fig 
ure  •!  with  do'ted  lines.  match  based  or,  a  'jenk  arc  is 
a  u'caF  match  and  further  e-. nfence  is  required  to  cor.firm 

the  hypothesis 


Termination  of  the  matching  procedure  occurs  if  the 
fritC  on  of  stirface  patches  matched  exceeds  a  threshold. 
Once  a  match  ha.s  been  determined  (i.c.,  the  search  pro¬ 
cedure  has  reached  the  leaf  node  of  the  tree),  the  num- 
i  1  r  of  poi-ti'.e  pairings  .in,  non-wild  card  pairings)  is 
rompiited  If  I!;'  numb'’'  r-  'ess  than  the  threshold  frac¬ 
tion  then  'tie  ’  rore/liire  tiacKtracks  and  searches  other 
branches  \!  e-.  rrs  ^'.see  *  fi,.  best  possihie  match  is  com¬ 
pared  with  t,he  c-irrr.,;i  i„.st  TTialch,  If  ! hc  best  possible 
match  IS  'malmr  than  the  C'lrrent  match,  then  the  search 
along  that  bra'  r!i  is  ahandoned  and  the  next  branch  is 
investigated. 

RE.SULT5 

In  this  sec';,,:i  ',,.e  present  an  evainpie  of  a  range  im¬ 
age  equenrr  and  ctcscribe  iiow  the  matching  algorithm 
computes  the  s'l r ',->re  cr,rres pondenres  The  algorithm  wa.s 
tried  siicressC.;  ,,,,  types  nf  range  image  se- 

’pieiires. 


Figure  a  The  generated  hgpergraphs  of  the  range  image s 

The  first  hyperedge  pair  hypothesized  to  match  is 
{A.i.y)  in  the  fict  s^pne  matches  {3.4.11)  The  vertex 
with  the  highest  degree  h  is  considered  M  the  fits'  sertex 
The  Unary  constraints  lease  nr,!v  one  option  i  e.,  ,  h,3,  ^s 
the  first  node  m  the  interpretation  tree  Howeser.  tj,p 
two  vertices  t  and  j  do  .not  .'natch  anv  ser'ex  so  thei  are 
matched  wi'li  'he  ss lid  raro  *.  NVite  that  in  'he  finai  rr.atrh 
that  IS  nhiained  (he  pairing  !h,3)  is  an  incorrect  pairing. 
The  algorithm  backtracks  and  finds  (he  correct  match  even 
though  we  s'.art  with  an  I'  c.srrect  match  W'e  present  the 
first  few-  steps  to  liliistra'e  -low  the  algorithm  works.  The 
second  hyperedgr  considered  for  ."natch  i.s  (h.g)  because  it 
IS  connected  In  the  firs'  hsperedge  at  h.  Since  the  current 
hypothesis  IS  ih,.3).  th'-  next  hsperedge  match  considered 
is  between  (fi.5}  and  {3.2}  The  unary  constraints  arc 
-satisfied  between  the  pair  \g.2)  so  th'  binary  constraints 
of  angle.  dis!,ince  and  the  'panning  vector  are  tested  .Ml 
the  constraints  are  satisfied  so  the  match  pair  is  accepted 


in  the  currcm  hypothesis.  The  next  hyperedge  eoiisiilored 
now  is  {j,  k}  as  it  is  connected  to  the  first  hyperedge.  The 
match  between  and  {11, 12)  is  tried  and  the  pairing 
(^,11)  satisfies  ail  the  constraints,  but  the  connectivity  is 
not  satisfied  (ifc  is  not  connected  to  h  while  11  is  connected 
to  3).  At  this  point  we  use  the  fact  that  the  arc  between 
3  and  1 1  is  a  weak  one  so  it  can  be  broken  and  all  the 
constraints  are  satisfied. 

The  procedure  continues  till  a  complete  match  (i.e., 
all  the  vertices  are  ..ccounted  for)  is  obtained.  The  match 
size  is  evaluated  and  if  a  better  match  can  be  obtained, 
the  procedure  backtracks  to  improve  the  results.  The  final 
matching  results  are: 


It  may  bo  observed  that  in  the  example  shown  there 
are  many  errors  in  segmentation  {for  eg.  surfaces  j,  7. 
U.  13.  etc.)  and  there  are  surfaces  that  get  occluded 
in  one  of  the  scenes  (for  eg.  I  and  1);  notwithstanding, 
the  algorithm  performs  well  and  the  corr,.;pondences  are 
evaluated. 

CONCLUSION 

Computing  motion  and  tracking  an  object  over  a  se¬ 
quence  of  range  images  involves  establishing  correspon¬ 
dence  between  the  features  of  the  object  in  different  im¬ 
ages  in  the  sequence.  The  question  of  finding  correspon¬ 
dence  in  a  sequence  of  range  images  is  very  different  from 
finding  correspondence  between  a  model  and  an  object  de¬ 
scription.  The  fundamental  difference  lies  in  the  fact  that 
the  model  description  of  the  object  is  complete,  while  in 
case  of  a  sequence  of  range  images,  both  descriptions  of 
the  scene  are  incomplete.  The  lack  of  information  forces 
one  to  impose  only  weak  constraints  and  allow  for  larger 
tolerances. 

We  presented  a  new  framework  and  procedure  to  com¬ 
pute  the  correspondences  between  surface  segments  in  a 
sequence  of  range  images.  Fundamental  to  our  frame¬ 
work  is  the  hypergraph  representation  of  the  range  im¬ 
ages.  The  hierarchical  representation  of  hypergraphs  not 
only  reduces  the  search  space  significantly,  but  also  facil¬ 
itates  the  encoding  of  the  topological  and  geometrical  in¬ 
formation.  In  addition  to  the  topological  and  geometrical 
information  obtained  from  the  scene  we  also  use  a  priori 
knowledge  of  the  scene  obtained  from  the  physics  of  the 
iaser  scanning  process  used  to  produce  the  range  images. 
Each  piece  of  information  used  reduces  the  complexity  of 
the  matching  procedure  by  pruning  the  search  space.  The 
solution  is  robust  and  accounts  for  errors  in  segmentation, 
occlusions  of  surfaces,  and  noise  in  the  data.  By  using 
the  topological  information  to  guide  the  search  procedure, 
the  average  case  complexity  of  the  algorithm  is  reduced 
significantly. 
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ABSTRACT 

In  this  paper  we  present  a  novel  technique  for  establishing 
a  robust  and  accurate  correspondence  between  a  3d  model 
and  a  2d  image.  We  present  a  transform  clustering  ap¬ 
proach  to  isolate  the  transformation  that  maps  the  model 
features  to  the  image  features.  It  is  shown  that  this  trans¬ 
form  clustering  technique  alleviates  the  problems  with  us¬ 
ing  (he  traditional  Hough  transform  techniques  used  by 
l>revious  researchers.  We  demonstrate  the  effectiveness  of 
our  approach  in  estimating  the  position  and  and  pose  of  an 
autonomous  mobile  robot  navigating  in  an  outdoor  urban 
environment.  We  present  experimental  results  of  testing 
this  approach  using  a  model  of  an  airport  scene. 

INTRODUCTION 

The  task  of  establishing  a  reliable  and  accurate  correspon¬ 
dence  between  an  image  of  a  scene  and  a  stored  model  of 
it  occurs  in  a  large  number  of  computer  vision  problems. 
Autonomous  navigation  of  a  mobile  robot  given  a  priori 
model  of  the  environment  and  model-based  object  recog¬ 
nition  are  two  examples  of  computer  vision  tasks  in  which 
•he  r:odel-image  correspondence  needs  to  be  addressed.  In 
the  context  of  autonomous  navigation,  the  robot  is  pro¬ 
vided  with  a  preloaded  world  model  of  the  environment. 
The  world  model  could  be  in  different  forms,  such  as  a 
Digital  Elevation  Map  (DEM),  a  CAD  description,  or  a 
floor  map.  The  robot  uses  an  onboard  camera  to  image 
the  environment.  Once  we  establish  a  correspondence  be¬ 
tween  the  image  and  the  model,  the  robot’s  position  and 
pose  can  be  determ’ncd.  This  position  information  can  be 
used  by  the  robot  to  successfully  navigate  in  its  environ¬ 
ment.  In  the  context  of  model-based  object  recognition, 
we  are  given  a  geometric  description  of  the  object  to  be 
recognized  and  an  image  of  the  scene  in  which  the  object 
is  present.  The  task  is  to  isolate  the  object  in  the  scene 
by  u'ing  the  image.  Model-image  correspondence  are  par- 
licuiarly  difficult  because  the  image  and  the  model  are 
usually  in  different  formats,  different  co-ordinate  frames 
and  of  different  dimensions. 

A  popular  approach  tc  solving  this  problem  is  to  ex¬ 
tract  features  from  the  image  and  search  the  model  de¬ 
scription  for  the  corresponding  set  of  features.  The  type 
of  features  required  and  the  number  of  features  used  de¬ 
pends  on  the  model  description  and  what  is  assumed  to 
be  known  about  the  scene.  For  example,  in  navigating  the 
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Scientific  Reaeaich  (AFSC)  contract  F496JO-89-C-OOSt. 


robot  in  an  indoor  structured  environment  with  a  given 
CAD  model  of  the  environment,  it  is  common  practice  to 
use  line  segments  as  features  [3].  On  the  other  hand,  in 
navigating  the  robot  in  an  outdoor  mountainous  terrain 
given  a  DEM  of  the  environment,  using  curves  may  be  a 
logical  choice  [9], 

Typically,  in  these  problems  the  model  and  the  cam¬ 
era  (robot)  are  specified  in  two  different  co-ordinate  sys¬ 
tems.  Once  we  extract  the  relevant  features  from  the  im¬ 
age  and  identify  the  corresponding  features  in  the  model, 
we  can  computer  the  transformation  T  that  maps  the 
model  features  into  the  image  features.  The  parameters  of 
this  transformation  are  the  required  position  and  pose  of 
the  camera  (robot)  with  respect  to  the  model.  Solving  for 
the  parameters  of  T  ,  once  a  set  of  model-image  feature 
correspondences  is  established,  is  a  very  well  studied  prob¬ 
lem  [2].  Therefore,  the  crucial  task  to  be  accomplished 
is  that  of  establishing  a  reliable  and  accurate  correspon¬ 
dence.  Noise,  occlusions,  errors  in  feature  detection  and 
inaccurate  model  descriptions  further  complicate  this  cor¬ 
respondence  problem. 

TVansform  Clustering;  Previous  researchers  have 
considered  the  technique  of  matching  a  key  model  feature, 
such  as  a  long  edge  or  a  set  of  lines  in  specific  orientations, 
to  establish  an  initial  transformation  [1,  6].  Subsequent 
assignments  are  then  used  to  refine  this  transformation. 
New  assignments  are  selected  on  predictions  of  a  model 
feature,  projected  into  the  image  using  the  current  trans¬ 
formation.  However,  these  techniques  assume  that  it  is 
possible  to  initially  select  a  correct  key  model  feature, 
which  may  not  always  be  possible. 

Some  researchers  used  the  generalized  Hough  trans¬ 
form  and  its  related  parameter  hashing  techniques  to  per¬ 
form  transform  clustering  to  isolate  the  transformation 
mapping  the  model  features  into  the  image  features  [6, 
5,  10].  The  generalized  Hough  transform  works  by  first 
quantizing  the  n-dimensional  parameter  space  into  dis¬ 
crete  buckets  or  bins.  The  parameters  are  the  compo¬ 
nents  describing  T.  From  the  given  image,  features  are 
extracted  using  a  feature  extractor.  Then  all  the  pos¬ 
sible  model-image  feature  correspondences  are  hypothe¬ 
sized  and,  for  each  hypothesis,  the  parameter  vector  is 
computed.  For  each  parameter  vector  so  computed,  its 
n  components  are  quantized  and  used  as  indices  to  vote 
in  one  of  the  n-dimensional  buckets.  Searching  for  large 
clusters  is  then  accomplished  by  finding  the  buckets  with  a 
large  numbers  of  entries.  Sometimes  it  is  possible  that  one 
correspondence  may  not  give  explicitly  all  the  components 
of  the  parameter  vector,  but  may  only  give  a  range  of  pos¬ 
sible  values  for  each  component.  In  this  case,  entries  are 


579 


made  into  ail  the  buckets  within  range.  The  advantage  of 
this  approach  is  that  clustering  provides  a  robust  ccilecton 
for  selecting  valid  model  feature  assignments.  The  effects 
of  missing  or  incorrect  features  due  to  occlusion,  shadows, 
or  low  contrast,  are  not  felt. 

The  problems  associated  with  using  the  Hough  trans¬ 
form  approach  to  transform  clustering  arc  that  large  trans¬ 
form  clusters  may  occur  randomly.  If  these  clusters  are  as 
large  or  larger  than  those  due  to  the  correct  transform, 
the  estimation  procedure  that  relies  only  on  the  Hough 
transform  will  be  erroneous.  If  the  number  of  buckets  is 
increased,  then  the  possibility  of  random  large  clusters  is 
alleviated  but  the  number  of  computations  grows  rapidly. 
Crimson  [-1)  summarizes  these  problems  with  the  gener¬ 
alized  Hough  transform. 

This  paper  presents  a  method  to  reduce  the  prob¬ 
lems  associated  with  the  Hough  transform  approach  to 
transform  clustering  by  using  a  partition  of  the  parame¬ 
ter  space,  which  is  not  necessarily  uniform.  The  partition 
is,  in  fact,  intelligent  and  uses  a  priori  model  informa¬ 
tion,  Due  to  the  geometric  constraints  imposed  by  the 
model  and  the  camera  geometry,  not  all  model  features 
may  be  visible  in  all  camera  positions.  Typically  occlu¬ 
sions  between  the  model  features  affect  their  visibility  at 
various  positions.  However,  since  we  know  the  3d  descrip¬ 
tions  of  the  model  features,  these  geometric  constraints 
can  be  pre-computed  and  used  to  partition  the  parame¬ 
ter  space  to  reduce  the  probability  of  the  occurrence  of 
random  transform  clusters. 

We  demonstrate  the  effectiveness  of  our  approach  in 
estimating  the  position  and  and  pose  of  an  autonomous 
mobile  robot.  The  robot  is  assumed  to  be  navigating  in 
an  outdoor,  urban  environment.  The  3d  description  of  the 
bties  that  constitute  the  rooftops  of  the  buildings  is  given 
as  a  world  model.  The  position  and  pose  of  the  robot  ate 
estimated  by  establishing  a  correspondence  between  the 
lines  extracted  from  the  image  (image  features)  and  the 
lines  that  constitute  rooftops  of  the  buildings  (model  fea¬ 
tures).  Hy  exploiting  the  visibility  constraints  imposed  by 
the  3d  world  model  and  the  camera  geometry,  we  partition 
the  parameter  space  into  into  distinct,  non-overlapping  re¬ 
gions  called  Edge  Visibilitg  Regions  {E\'Rs}  [71.  In  each  of 
these  regions,  we  also  store  the  list  of  model  features  that 
are  visible  from  within  that  region.  We  then  hypothesize  a 
corrcspoi.Jcnce  between  all  pairs  of  model  and  image  fea¬ 
tures  and  compute  the  range  of  possible  transformations 
for  each  hypothesis.  We  vote  in  all  the  regions  in  the  pa¬ 
rameter  space  where  this  transformation  is  valid.  After 
considering  ail  the  pairings,  we  select  the  regions  in  the 
parameter  space  with  the  large  numbers  of  votes  as  the 
candidate  EVRs  for  position  estimation.  The  actual  cor¬ 
respondence  and  position  estimation  are  then  performed 
by  a  constrained  search  process  within  these  EVRs  using 
a  interpretation  tree  search  paradigm. 

PARTITIONING  THE  PARAMETER 
SPACE 

Consider  the  world  coordinate  system  OXYZ  and 
the  robot  coordinate  system  CXyZ'  shown  in  Figure  1. 
Generally,  the  transformation  T  that  transforms  O.VV' if 
into  O'X'Y'Z'  has  six  degrees  of  freedom:  tlircc  rotational 
and  three  translational.  Sometimes,  depending  on  the  ap¬ 
plication.  some  of  these  degrees  of  freedom  can  be  elimi¬ 


nated.  Most  mobile  robot  seif-location  tasks  make  the  as¬ 
sumption  that  the  robot  is  on  the  ground  {OXY  plane), 
so  the  Z-transIation  (the  height  of  the  robot  above  the 
ground)  is  assumed  to  be  known  or  to  be  zero.  The  cam¬ 
era  on  the  robot  is  assumed  to  have  zero  roll  (rotation 
about  A-axis),  and  the  tilt  angle  of  the  camera,  (rotation 
about  the  Y -axis)  is  assumed  to  be  measurable.  So,  there 
are  effectively  three  parameters  in  the  transformation;  two 
translational  {X,Y)  and  one  rotational  S  (the  pan  angle 
of  the  camera,  which  is  a  rotation  about  the  Z-axis).  Like¬ 
wise,  in  this  paper  we  have  only  three  parameters  of  T; 
Xy  Y  and  9.  The  parameter  space  of  the  transformation 
is  thus  the  entire  OXY  plane  and  the  range  of  robot  ori¬ 
entation  ^  is  0  through  360  degrees. 

In  this  section,  we  briefly  we  describe  a  method  for 
partitioning  the  OXY  plane  into  regions  called  Edge  Vis¬ 
ibility  Regions  (EVRs)  using  the  given  world  model  de¬ 
scription.  For  more  details  sec  ['].  Assuv  .ated  with  each 
EVR  is  a  list  of  the  world  model  features  visible  in  that  re 
gion,  called  the  visibility  list  (VL).  .Vo  two  adjacent  EVR» 
have  the  same  VL.  Also  stored  for  each  entry  in  the  VL 
of  an  EVR  is  the  range  of  robot  orientations  from  which 
the  feature  is  visible.  Thus,  cacli  EVR  is  a  region  of  space 
which  has  the  topological  properly  that  from  its  points, 
the  same  set  of  edges  of  the  model  arc  visible  through 
a  complete  circular  scan.  The  EVR  representation  par¬ 
titions  the  entire  parameter  space  of  (X,Y,^)  and  cap¬ 
tures  the  visibility  constraints  between  the  world  model 
features. 

The  algorithm  that  divides  the  OXY  plane  into  the 
desired  EVRs,  along  with  their  associated  VLs,  uses  three 
subprocesses  called  Split,  Project,  and  Merge.  The  algo¬ 
rithm’s  basic  idea  is  to  start  with  the  entire  OXY  plane  as 
one  EVR  with  a  NULL  visibility  list.  Each  of  the  polygon- 
that  makes  up  the  building’s  rooftop  in  the  world  model  is 
considered  in  turn  by  extending  its  edges,  and  the  EVRs 
that  are  intersected  are  divided  into  two  new  ones.  The 
new  EVRs  then  replace  the  old  one,  and  the  VLs  of  the 
new  EVRs  are  updated  to  account  for  tlie  visibility  of  this 
edge  by  considering  it  to  be  visible  in  one  half- plane,  say 
the  half-plans  into  the  left  of  the  edge,  and  invisible  in 
the  other.  The  Split  process  handles  this  updating.  For 
each  new  rooftop  considered,  the  mutual  occlusion  of  the 
rooftop’s  edges  with  the  other  existing  rooftops  is  handled 
by  forming  the  shadow  region  of  these  edges  on  the  otl.or 
existing  rooftops.  The  Project  process  iiandles  the  forming 
of  these  shadow  regions.  Fiiuilly,  tiie  .Merge  process  con¬ 
catenates  all  the  adjacent  EVRs  with  identical  VLs  into 
one  EVR.  .\ftcr  partitioning  tlie  O.Y>'  plane  into  EVRs, 


the  1  ange  of  the  robot's  orientations  for  which  each  model 
feature  in  the  VL  of  an  EVR  is  visible,  is  also  computed 
and  stored.  An  efficient  method  to  compute  these  ranges 
is  also  developed.  Figure  2(a)  shows  the  world  model  and 
Figure  5{b)  shows  the  EVR  description  computed  from 
this  world  model. 

FEATURE  EXTRACTION 

In  this  research,  we  iisctl  a  scale  model  of  the  Austin  Ex¬ 
ecutive  Park  .Xirporl  to  lest  the  position  estimation  algo¬ 
rithms  developed.  The  world  model  thus  consists  of  the 
3d  descriptions  of  the  rooftops  of  the  three  buildings  in 
this  airport.  Figure  2(a}  shows  this  world  model.  A  cali¬ 
brated  camera  is  placed  in  this  environment  and  used  to 
acquire  the  images  of  the  model.  These  are  then  used  as 
the  robot's  views.  Figure  2(b)  shows  one  such  view.  We 
use  a  Canny  edge  detector  to  extract  the  edges  from  this 
image.  Contiguous  edges  are  then  linked  using  a  pixel 
chaining  algorithm.  We  then  use  a  line  fitting  technique 
to  form  line  segments  from  these  pixel  chains.  These  line 
segments  are  then  thresholded  by  length  to  remove  all  the 
tines  shorter  than  20  pixels.  Figure  3(a)  shows  these  lines. 
We  use  a  rooftop  extraction  technique  to  select  the  lines 
that  correspond  to  the  rooftops  only.  The  technique  scans 
each  column  of  the  Line  segment  image  from  top  to  bottom 
and  selects  the  topmost  lines  only  in  each  column.  All  the 
liucs  that  lie  below,  completely  within  the  projection  of  a 
selected  line,  are  then  discarded.  The  lines  isolated  using 
this  technique  are  then  considered  as  the  image  features. 
Figure  3(b)  shows  these  lines.  Notice  that  the  image  fea¬ 
ture  extraction  procedure  is  far  from  perfect.  Some  of  the 
lines  that  correspond  lo  rooftops  are  not  extracted  and, 
due  to  noise  and  occlusion,  some  of  the  extracted  lines 
do  not  arise  from  the  rooftops  but  from  extraneous  ob¬ 
jects  such  as  trees  and  telephone  poles.  The  task  is  thus 
to  use  the  transform  clustering  and  the  search  technique 
to  correctly  isolate  the  model  features  and  the  noise  fea¬ 
tures  from  these  image  features  and  accurately  estimate 
the  robot's  position  and  pose  in  the  environment. 

MODIFIED  HOUGH  TRANSFORM 

Having  formed  the  EVR  description  of  the  environ¬ 
ment  and  extracted  the  features  from  the  images,  we  use  a 
mod. .led  Hough  transform  to  isolate  a  small  set  of  EVRs 
likely  to  contain  the  robot's  location.  The  EVRs  are  used 
as  a  partitioning  of  the  parameter  space  (X,Y,6)  of  the 
transformation.  We  find  that  this  partitioning  alleviates 
the  problems  of  traditional  Hough  transform,  namely,  the 
random  occurrence  of  large  clusters  and  the  resulting  need 
for  the  large  .Knouiit.s  of  memory  required  to  perform  the 


fine  partitioning  of  the  parameter  space  to  eliminate  this 
problem.  Since  it  is  difficult  to  accurately  extract  the  end 
points  of  the  rooftops,  w'e  use  infinite  lines  and  not  line 
segments  as  the  image  features.  The  image  features  are 
2d  lines  and  the  model  features  are  3d  lines.  Using  one  2d 
to  3d  line  correspondence,  we  can  compute  the  orientation 
of  the  robot  6  and  get  a  constraint  on  the  position  of  the 
robot  of  the  form  aX  -f  bY  -f  c  =  0,  where  a,b,  and  c 
are  construnts.  This  constraint  describes  a  line  L  in  the 
OXy  plane.  See  (8)  for  details  of  the  derivation. 

VVe  hypothesize  all  the  possible  model-image  feature 
correspondences,  and  for  each  hypothesis  compute  the  ff 
and  get  the  constraint  line  L  on  (X.V).  We  now  vote  in 
all  the  EVRs  where:  1)  the  line  L  intersects  the  EVR;  and 
2)  the  f  lies  within  the  range  of  possible  robot  orientations 
in  the  visibility  list  of  the  EVR.  We  finally  select  the  EVR 
with  a  largest  numbers  of  votes  as  the  candidate  EV‘'Rs 
most  likely  to  contain  the  robot’s  location.  Figure  4  shows 
a  plot  of  the  EVR  number  vs.  the  number  of  votes.  Figure 
5(a)  shows  the  complete  EVR  description  and  Figure  5(b) 
shows  the  selected  candidate  with  a  large  number  of  votes. 

INTERPRETATION  TREE  SEARCH 

Having  isolated  the  candidate  set  of  EVRs  most  likely  to 


Figure  5:  (a)  EVR  description  (b)  EVRs  isolated  by  the 
Hough  transform 
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Figure  6:  (a)  Final  EVR  (b)  Estimated  robot  location 
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Table  1:  Tlic  Search  results 

contain  the  robot’s  location  using  the  modified  Hough 
transform,  we  now  wish  to  isolate  the  robot’s  location 
more  precisely  among  these  EVKs.  For  each  of  the  candi¬ 
date  E\'Rs  we  form  an  inicrpretation  tree  of  all  the  possi¬ 
ble  model-image  feature  correspondences  and  then  search 
this  frees  to  isolate  the  correct  set  of  correspondences. 
.Vote  that  these  tree  are  very  short  since  we  only  need 
to  consider  those  model  features  that  are  present  in  each 
E\  R's  \'L.  Also  by  using  the  geometric  constraints  estab¬ 
lished  by  the  EVR,  that  is,  its  extent  in  the  OXY  plane 
and  tr.e  range  of  possible  9  values,  we  can  prune  large 
parts  of  this  interpretation  tree. 

This  search  process  finally  isolates  the  correct  EVR 
containing  the  robot’s  location  and  a  set  of  model-image 
feature  correspondences.  Using  all  of  these  correspon¬ 
dences  in  a  least  squares  framework,  the  robot’s  position 
and  pose  are  accurately  estimated.  Figure  6{a)  shows  the 
EVR  isolated  as  containing  the  robot’s  location  amd  Figure 
6(b)  shows  the  final  estimated  robot’s  position.  We  find 
that  the  estimated  position  and  pose  obtained  by  these 
techniques  are  quite  close  to  their  true  values.  Table  1 
compares  the  estimated  and  the  actual  values  obtained 
from  the  test  runs  using  the  world  model  shown  in  Figure 
2(a). 

CONCLUSIONS 

This  paper  presented  a  novel  and  efficient  transform  clus¬ 
tering  technique  for  establishing  a  robust  and  accurate 
correspondence  between  a  3d  model  and  a  2d  image.  We 
demo.'.strate  the  effectiveness  of  this  technique  in  estimat¬ 
ing  i.be  position  and  pose  of  an  autonomous  mobile  robot 
in  a.n  outdoor  urban  environment  consisting  of  polyhe¬ 
dral  buildings.  It  is  shown  that  this  transform  cluster¬ 
ing  technique  alleviates  the  problems  associated  with  the 
•  r.vi  1  Hough  tr.uisform  techniques  u.sed  hy  previous 

■iiougli  we  have  ileiiioii.st rated  the  utility  of  the 

..I-  for  the  moblie  roi<o;  -.eif-lotatioii  jirublem.  the 


approach  can  be  easily  extended  to  other  computer  vision 
tasks  such  as  model-based  object  recognition.  One  pos¬ 
sible  approach  is  to  precompute  the  characteristic  views 
or  aspects  of  the  object  to  be  recognized  and  use  these  to 
partition  the  parameter  space.  By  imposing  suitable  and 
practical  restrictions  on  the  number  of  degrees  of  freedom 
in  the  transformation  between  the  model  and  the  imag- 
[6j,  the  number  of  aspects  can  be  kept  tractable,  by  se¬ 
lecting  an  appropriate  set  of  features  from  the  image  and 
using  a  similar  transform  clustering  approach  as  described 
in  this  paper,  it  is  possible  to  isolate  a  small  set  of  aspects 
of  the  object  corresponding  to  the  given  image.  Using 
a  tree  search  technique  it  is  then  possible  to  establish  a 
more  accurate  correspondence  between  the  image  features 
and  the  model  features  and  isolate  the  correct  aspect,  and 
thereby  recognize  the  object  from  the  given  set  of  models. 
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Abstract. 

This  paper  describes  a  new  algorithm  to  simultaneously  detect  and  clas¬ 
sify  straight  lines  according  to  their  orientation  in  3-D.  The  fundamental 
assumption  is  that  the  most  “interesting”  lines  in  a  3-D  scene  have  orien¬ 
tations  which  fall  into  a  few  precisely  defined  categories.  The  algorithm  we 
propose  uses  this  assumption  to  extract  the  projection  of  straight  edges  from 
the  image  and  to  determine  the  most  likely  corresponding  orientation  in  the 
3-D  scene.  The  extracted  2-D  line  segments  are  therefore  “perceptually” 
grouped  according  to  their  orientation  in  3-D.  Instead  of  extracting  all  the 
line  segments  from  the  image  before  grouping  them  by  orientation,  we  use 
the  orientation  data  at  the  lowest  image  processing  level,  and  detect  seg¬ 
ments  separately  for  each  predefined  3-D  orientation.  A  strong  emphasis  is 
placed  on  real-world  applications  and  very  fast  processing  with  conventional 
hardware. 


1  Introduction 

This  paper  presents  a  new  algorithm  for  the  detection  and  organization  of  line  segments 
in  images  of  complex  scenes.  The  algorithm  extracts  line  segments  of  particular  3-D 
orientations  from  intensity  images.  The  knowledge  of  the  orientation  of  edges  in  the 
3-D  scene  allows  the  detection  of  important  relations  between  the  segments,  such  as 
parallelism  or  perpendicularity. 

The  role  of  perceptual  organization  [5]  is  to  highlight  non-accidental  relations  between 
features.  In  this  paper,  we  extend  the  results  of  perceptual  organization  for  2-D  scenes 
to  the  interpretation  of  images  of  3-D  scenes  with  any  perspective  distortion.  For  this, 
we  assume  a  priori  knowledge  of  prominent  orientations  in  the  3-D  scene.  Unlike  other 
approaches  to  space  inference  using  vanishing  points  [1],  we  use  the  information  about 
3-D  orientations  at  the  lowest  image-processing  level  for  maximum  efficiency. 

The  problem  of  line  detection  without  first  computing  a  free-form  edge  map  was 
addressed  by  Burns  et  al.  [2].  His  algorithm  first  computes  the  intensity  gradient  ori¬ 
entation  for  all  pixels  in  the  image.  Next,  the  neighboring  pixels  with  similar  gradient 
orientation  are  grouped  into  “line-support  regions”  by  a  process  involving  coarse  ori¬ 
entation  “buckets.”  Finally,  a  line  segment  is  fit  to  the  large  line-support  regions  by  a 
least-squares  procedure.  An  optimized  version  of  this  algorithm  was  presented  in  [3). 

The  algorithm  described  in  this  paper  is  designed  not  only  to  extract  2-D  line  segments 
from  an  intensity  image,  but  also  to  indicate  what  are  the  most  probable  orientations 
for  the  corresponding  3-D  segments  in  the  scene.  Section  2  explains  the  geometry  of 

*  This  research  was  supported  in  part  by  the  DoD  Joint  Services  Electronics  Program  through 
the  Air  Force  Office  of  Scientific  Research  (AFSC)  Contract  F49620-89-C-0044,  and  in  part 
by  the  Army  Research  Office  under  contrrurt  DAAL03-91-G-0050. 


projecting  segments  of  known  3-D  orientation.  Section  3  describes  a  very  fast  algorithm 
to  extract  the  line  segments  from  a  single  image  and  to  simultaneously  estimate  their 
3-D  orientation.  Finally,  Sect.  4  provides  experimental  results  obtained  with  images  of 
indoor  scenes  acquired  by  a  mobile  robot. 


2  Motivation  and  Assumptions 

Wc  chose  to  concentrate  on  objects  which  have  parallel  lines  with  known  3-D  orientations 
in  a  world  coordinate  system.  For  example,  in  indoor  scenes,  rooms  and  hallways  usually 
have  a  rectangular  structure,  and  there  are  three  prominent  orientations  for  3-D  line 
segments;  one  vertical  and  two  horizontal  orientations  perpendicular  to  each  other.  In 
this  paper,  any  3-D  orientation  is  permitted,  as  long  as  it  is  given  to  the  algorithm. 
Therefore,  more  complex  environments,  such  as  polygonal  buildings  with  angles  other 
than  90  degrees,  are  handled  as  well  if  these  angles  are  known.  It  is  important  to  note  that 
human  vision  also  relies  on  prominent  3-D  orientations.  Humans  feel  strongly  disoriented 
when  placed  in  a  tilted  environment. 

Vertical  lines  constitute  an  interesting  special  case  for  two  reasons:  they  are  especially 
common  in  man-made  scenes,  and  their  3-D  orientation  can  easily  be  known  in  the  3-D 
camera  coordinate  system  by  measuring  the  direction  of  gravity.  If  a  2-axis  inclinometer 
is  mounted  on  the  camera  and  properly  calibrated,  a  3-D  vertical  vector  can  be  expressed 
in  the  3-D  coordinate  system  aligned  with  the  2-D  image  coordinate  system.  Inexpensive 
commercial  inclinometers  have  a  precision  better  than  0.01  degree.  Humans  also  sense 
the  direction  of  gravity  by  organs  in  their  inner  ears.  In  our  experiments,  we  estimate  the 
third  angular  degree  of  freedom  of  the  camera  relative  to  the  scene  from  the  odometer 
readings  of  our  mobile  robot.  Provided  that  the  odometer  is  constantly  corrected  by 
vision  [4],  the  odometer  does  not  drift  without  bounds. 

We  can  infer  the  likely  3-D  orientation  of  the  line  segments  from  their  2-D  projections 
in  the  image  plane.  With  a  pinhole  perspective  projection  model,  lines  parallel  to  each 
other  in  the  3-D  scene  will  converge  to  a  vanishing  point  in  the  2-D  projection.  In  partic¬ 
ular,  if  the  orientation  of  the  camera  relative  to  the  scene  is  known,  a  vanishing  point  can 
be  computed  for  each  given  3-D  orientation  before  the  image  is  processed.  All  the  lines 
that  have  a  given  orientation  in  3-D  must  pass  through  the  associated  vanishing  point 
when  projected.  Conversely,  if  a  line  does  not  pass  through  a  vanishing  point,  it  cannot 
have  the  3-D  orientation  associated  with  that  vanishing  point.  In  practice,  if  a  line  does 
pass  through  a  vanishing  point  when  projected,  it  is  likely  to  have  the  associated  3-D 
orientation. 

To  summarize,  the  line  detection  algorithm  of  Sect.  3  knows  in  each  point  of  the 
image  plane  the  orientation  that  a  projected  line  segment  would  have  if  it  had  one  of 
the  predefined  3-D  orientations.  Therefore,  the  basic  idea  is  to  detect  the  2-D  segments 
with  one  of  the  possible  orientations,  and  mark  them  with  the  associated  3-D  orientation 
hypothesis. 

3  Detecting  Segments  and  Estimating  their  3-D  Orientation 

3.1  Coordinate  Systems  and  Transformations 

The  coordinate  systems  are  W  (the  World  coordinate  system,  with  a  vertical  z-axis), 

R  (the  Robot  coordinate  system,  in  which  we  obtain  the  inclinometer  and  odometer 
readings),  C  (the  Camera  coordinate  system),  and  P  (the  coordinate  system  used  for 


the  perspective  projection  on  the  retina).  The  homogeneous  coordinate  transformation 
matrix  from  \\  to  It  is  Twr  —  TroiiTpKch^cAding^rftndftiioni-  ^oii  ind  Tpitch  known 
with  a  good  precision  through  the  inclinometer.  Th.Kiinf  is  estimated  by  the  odometer 
and  Tifinsiations  ‘s  not  used  here.  Trc,  the  coordinate  transformation  matrix  from  R  to  C, 
needs  to  be  completely  determined  through  eye/wheel  calibration.  Finally,  Tcp  is  known 
through  camera  calibration. 


3.2  Overview  of  the  Algorithm 

The  processing  can  be  outlined  as  follows: 

1.  Line  support  region  extraction:  compute  the  angle  between  the  intensity  gradient 
at  each  pixel  and  the  expected  direction  of  the  projection  of  each  3-D  orientation 
(see  Sect.  3.3  for  details).  Use  a  loose  threshold  to  allow  for  noise  in  the  gradient 
orientation.  Reject  improper  pixels  and  3-D  orientations. 

2.  Non-maxima  suppression:  keep  only  the  local  gradient  maxima  along  the  estimated 
perpendicular  to  the  line. 

3.  Pixel  linking:  create  chains  of  pixels  using  a  partial  neighborhood  search  in  the  di¬ 
rection  of  the  estimated  vanishing  points.  This  creates  noisy  linear  chains. 

4.  Line  fitting:  perform  a  least-squares  fit  of  line  segments  to  the  pixel  chains.  Re¬ 
cursively  break  the  pixel  chains  which  cannot  be  closely  approximated  with  a  line 
segment  into  smaller  chains. 

5.  Global  orientation  check:  compute  the  match  between  each  line  and  each  3-D  orien¬ 
tation,  like  in  the  line  support  extraction  step  but  with  a  much  tighter  threshold. 

If  the  0  pnon  heading  is  very  uncertain,  the  lines  will  be  extracted  with  loose  thresholds, 
the  true  heading  will  be  estimated,  and  the  algorithm  can  then  be  run  again  with  tight 
thresholds  for  the  correct  categorization. 

3.3  Extracting  Line  Support  Regions 

For  each  pi.xel  in  the  input  intensity  image  and  for  each  category  of  possible  3-D  orienta¬ 
tions,  we  compute  the  angle  between  the  intensity  gradient  and  the  expected  direction  of 
the  line  in  2-D.  The  expected  line  is  given  by  the  current  pixel  and  the  vanishing  point 
associated  with  the  3-D  orientation.  It  is  not  necessary  to  compute  the  location  of  the 
vanishing  point  (which  may  lie  at  infinity). 

The  homogeneous  transformation  matrix  changing  world  coordinates  into  projective 
coordinates  is  Twp  =  Tcp^rc^wr-  Let  [px»  Py>  Pii  O]^  be  a  non-null  vector  in  the  3-D 
direction  under  consideration.  If  [su,  sv,  s,  l]J  =  Twp  \  z,  l]^  defines  the  relation 
between  a  2-D  point  [«•  u]  and  its  antecedent  by  the  perspective  projection,  then 

[s'u',  s'v',  s’,  l]p  =:  Twp  ([r,  y,  z,  l]J^  -f  [px.Py.P*.  O]^) 

defines  another  point  of  the  estimated  2-D  line.  A  2-D  vector  d  in  the  image  plane  pointing 
to  the  vanishing  point  from  the  current  point  is  then  collinear  to  [u' -  u,  v' —  v]  . 
Algebraic  manipulations  lead  to  [du.  dv]*^  =  [a*  —  a*u,  Oy  —  where 

T  T 

[flx»  flyi  O]  “  Twp  [pxi  Py  I  p2»  O]  yy  • 

.Note  that  Uxi  ay,  and  a*  need  to  be  computed  only  once  for  each  3-D  orientation. 


The  current  pixel  is  retained  for  the  3*D  direction  under  consideration  if  the  angle 
between  d  and  the  local  gradient  g  is  90  degrees  plus  or  minus  an  angular  threshold  7. 
This  can  be  expressed  by 

Ml  ■  llgll  ’ 

or  equivalently: 

(dx  9y  -  dy  <7x)’  >  (<^x  +  dl)  (Jx  +  jJ)  ^ 

with  r  ~  (cos  7)^  computed  once  for  all.  Using  this  formulation,  the  entire  line  sup¬ 
port  extraction  is  reduced  to  8  additions  and  11  multiplications  per  pixel  and  per  3-D 
orientation.  If  an  even  greater  speedup  is  desired,  {gl  +  gj)  may  be  computed  first  and 
thresholded.  Pixels  with  a  very  low  gradient  magnitude  may  then  be  rejected  before 
having  to  compute  d. 

4  Results 

The  algorithm  was  implemented  in  C  on  an  IBM  RS  6000  Model  530  workstation,  and 
tested  on  hundreds  of  indoor  images  obtained  by  our  mobile  robot.  The  predefined  3-D 
orientations  are  the  vertical  and  the  two  horizontal  orientations  perpendicular  to  each 
other  and  aligned  with  the  axes  of  our  building.  Figures  1  and  2  show  the  results  of 
line  extraction  for  one  image  in  a  sequence.  The  processing  time  is  only  2.2  seconds  for 
each  512  by  480  image.  Preliminary  timing  results  on  a  HP  730  desktop  workstation 
approach  only  a  second  of  processing,  from  the  intensity  image  to  the  list  of  categorized 
segments.  The  fast  speed  can  be  explained  partly  by  the  absence  of  multi-cycle  floating¬ 
point  instructions  from  the  line  orientation  equations,  when  properly  expressed. 

The  lines  are  not  broken  up  easily  by  a  noisy  gradient  orientation,  because  the  ori¬ 
entation  “buckets”  are  wide  and  centered  on  the  noiseless  gradient  orientation  for  each 
3-D  orientation  category.  The  output  quality  does  not  degrade  abruptly  with  high  im¬ 
age  noise,  provided  that  the  thresholds  for  local  gradient  orientations  are  loosened.  The 
sensitivity  to  different  thresholds  is  similar  to  that  of  the  Burns  algorithm;  a  single  set 
of  parameters  can  be  used  for  most  images.  A  few  misclassifications  occur  in  some  parts 
of  the  images,  but  are  marked  as  ambiguities. 

\Vc  have  compared  the  real  and  computed  3-D  orientation  of  1439  detected  segments 
from  eight  images  in  three  different  environments.  The  presence  of  people  in  some  scenes, 
as  well  as  noise  in  the  radio  transmission  of  images,  did  not  seem  to  generate  many  mis¬ 
classifications.  The  most  frequent  ambiguities  occurred  with  horizontal  segments  parallel 
to  the  optical  axis:  1.1  %  of  them  were  classified  as  possibly  vertical  in  3-D. 

5  Conclusion 

We  have  presented  a  new  algorithm  for  detecting  line  segments  in  an  image  of  a  3-D 
scene  with  known  prominent  orientations.  The  output  of  the  algorithm  is  particularly 
well  suited  for  further  processing  using  perceptual  organization  techniques.  In  partic¬ 
ular,  angular  relationships  between  segments  in  the  3-D  scene,  such  as  parallelism  or 
perpendicularity,  are  easily  verified.  Knowledge  of  the  3-D  orientation  of  segments  is  a 
considerable  advantage  over  the  traditional  2-D  perceptual  organization  approach.  The 
orientation  thresholds  of  the  2-D  perceptual  organization  systems  cannot  handle  a  sig¬ 
nificant  perspective  distortion  (such  as  the  third  orientation  category  in  Fig.  2).  The 
independence  from  the  perspective  distortion  brings  more  formal  angular  thresholds  to 


Fig.  1.  (a)  The  input  intensity  inaage,  and  (b)  the  2-D  segments 


Fig.  2.  The  line  segments  associated  with  each  3-D  orientation 


the  perceptual  organization  process.  By  using  the  3-D  orientation  at  the  lowest  image 
processing  level,  both  the  quality  and  speed  of  the  algorithm  were  improved.  The  ultimate 
benefits  of  this  approach  were  demonstrated  on  real  images  in  real  situations. 
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Abstract 

This  •paper  describes  new  algorithms  for  detecting 
and  interpreting  linear  features  of  a  real  scene  as  tm- 
aged  by  a  single  camera  on  a  mobile  robot.  The  low-- 
level  processing  stages  are  specifically  designed  to  in¬ 
crease  the  usefulness  and  the  quality  of  the  extracted 
features  for  a  semantic  interpretation.  The  detection 
and  interpretation  processes  provide  a  S-D  orientation 
hypothesis  for  each  S~D  segment.  This  in  turn  is  used 
to  estimate  the  robot's  orientation  and  relative  posi¬ 
tion  in  the  environment  and  to  delimit  the  free  space 
visible  in  the  image.  Next,  the  orientation  data  is  used 
by  a  motion  stereo  algorithm  to  fully  estimate  the  S-D 
structure  when  a  sequence  of  images  becomes  avail¬ 
able.  From  detection  to  S-D  estimation,  a  strong  em¬ 
phasis  IS  placed  on  real-world  applications  and  very 
fast  processing  with  conventional  hardware. 

1  Introduction 

This  paper  presents  a  new  approach  for  extracting 
semantically  significant  line  segments  from  monocu¬ 
lar  images,  for  estimating  a  mobile  robot’s  orientation 
and  relative  position,  and  for  identifying  key  objects 
associated  with  the  free  space,  such  as  walls  and  door¬ 
ways.  The  common  idea  is  to  design  all  the  stages  of 
image  interpretation,  including  the  lowest  image  pro¬ 
cessing  level,  such  that  they  provide  the  higher  stages 
with  the  most  semantically  useful  features. 

The  tasks  to  be  accomplished  by  the  robot  are  usu¬ 
ally  specified  in  high-level  semantic  terms,  such  as  “go 
down  the  hallway  and  go  through  the  last  door  to  the 

*Thi(  research  was  tupported  in  part  by  the  DoD  Joiat  Ser¬ 
vices  Electronici  Program  through  the  Air  Force  Office  of  Scien¬ 
tific  Research  ( AFSC)  Contract  P49620-89-C-004<t,  and  in  part 
by  the  Army  Research  Office  tmder  contract  DAAL03-91-G- 
0050. 


left.”  In  order  to  execute  this  task,  the  robot  must 
be  able  to  identify  the  objects  of  interest  (here,  hall¬ 
ways  and  doors)  in  its  perception  of  the  environment. 
One  approach  is  to  reconstruct  a  3-D  line  segment 
description  of  the  environment  of  the  robot  from  sev¬ 
eral  intensity  images  (see  for  example  [1]),  and  then 
to  match  groups  of  those  segments  to  selected  object 
models.  For  this,  several  3-D  segments  are  selected 
according  to  orientation  and  position  criteria  [9].  A 
3-D  hypothesis  of  an  object  is  generated  and  matched 
to  the  selected  segments.  The  process  is  repeated  for 
other  segments  and  other  hypotheses.  The  high-level 
semantic  interpretation  is  then  used  for  determining 
the  free  space  or  finding  objects  of  interest. 

Since  the  high-level  interpreter  is  looking  for  line 
segments  of  particular  orientations  in  3-D,  we  designed 
our  lower-level  image-processing  stages  to  take  advan¬ 
tage  of  this  information.  We  will  show  practical  ex¬ 
amples  of  how  this  top-down  information  can  benefit 
the  feature  extraction  stage  by  reducing  the  amount  of 
unwanted  features,  increasing  the  sensitivity  to  good 
features,  and  drastically  speeding  the  computation. 

Section  2  of  this  paper  discusses  the  a  priori  knowl¬ 
edge  of  a  few  prominent  3-0  orientations  for  lines  in 
the  scene.  Section  3  describes  a  very  fast  algorithm  to 
extract  useful  line  segments  from  a  single  image.  The 
algorithm  simultaneously  hypothesizes  the  3-D  orien¬ 
tation  of  segments.  Section  4  shows  how  the  robot’s 
orientation  in  the  scene  is  extracted  from  a  single  im¬ 
age,  and  how  the  results  of  Section  3  may  be  used 
for  a  first  semantic  interpretation  of  the  scene  with¬ 
out  the  3-D  depth  information.  Finally,  Section  5  ex¬ 
tends  these  results  to  a  sequence  of  monocular  images. 
A  Kalman  filter  is  used  for  a  recursive  estimatiou  of 
the  position  of  3-D  segments.  Unlike  purely  metrical 
approaches  for  the  recursive  estimation,  the  qualita¬ 
tive  information  derived  independently  for  each  image 
is  used  to  enhance  the  quality  and  usefulness  of  the 


reaulta  while  decreasing  the  computation  time.  The 
high-level  semantic  interpretation  of  the  resulting  3- 
O  maps  is  discussed.  Experimental  results  with  an 
indoor  mobile  robot  are  provided  and  analyzed. 

2  Motivation  and  assumptions 

2.1  Knowledge  of  prominent  3-D  orienta¬ 
tions 

Estimating  the  3-D  structure  of  a  scene  from  a  sin¬ 
gle  visual  image  is  impossible  without  certain  assump- 
t  ons  about  the  structure  and  the  projection  geometry. 
We  chose  to  concentrate  on  objects  which  have  paral¬ 
lel  lines  with  known  3-D  orientations  in  a  world  coor¬ 
dinate  system.  For  example,  in  indoor  scenes,  rooms 
and  hallways  usually  have  a  rectangular  structure,  and 
there  are  three  prominent  orientations  for  3-D  line  seg¬ 
ments;  one  vertical  and  two  horizontal  orientations 
perpendicular  to  each  other.  Outdoor  urban  scenes 
also  display  similar  characteristics.  In  this  paper,  any 
3-D  orientation  is  permitted,  as  long  as  it  is  given 
•o  the  algorithms.  Therefore,  more  complex  environ¬ 
ments,  SUCH  as  polygonal  buildings  with  angles  other 
than  90  degrees,  are  handled  as  well  if  these  angles 
are  known.  The  assumption  of  known  prominent  3-D 
orientations  is  consistent  with  the  goal  of  identifying 
objects  of  interest  for  a  mobile  robot,  such  as  walls, 
doorways,  or  entire  buildings. 

2.2  Orientation  of  the  camera  relative  to 
the  scene 

Vertical  lines  constitute  an  interesting  special  case 
for  two  reasons;  they  are  especially  common  in  man¬ 
made  scenes,  and  their  3-D  orientation  can  easily  be 
known  in  the  3-D  camera  coordinate  system.  If  a  2- 
axis  inclinometer  is  mounted  on  the  mobile  robot  and 
properly  calibrated,  a  3-D  vertical  vector  can  be  ex¬ 
pressed  in  the  3-D  coordinate  system  aligned  with  the 
2-D  image  coordinate  system.  Inexpensive  commer¬ 
cial  inclinometers  have  a  precision  better  than  0.01 
degrees.  Although  vibrations  and  calibration  errors 
may  corrupt  this  measurement,  it  can  serve  as  a  good 
estimate  of  the  camera’s  roll  and  pitch.  We  will  as¬ 
sume  that  either  the  robot  is  equipped  with  an  incli¬ 
nometer,  or  that  it  drives  on  an  approximately  planar 
I'orizontal  surface. 

We  estimate  the  third  angular  degree  of  freedom 
of  the  camera  from  odometer  readings  of  the  robot. 
Provided  that  the  odometer  is  constantly  corrected 


by  vision  (which  it  will  be  in  Section  4),  the  odome¬ 
ter  does  not  drift  without  bounds.  The  orientation  of 
the  camera  with  respect  to  the  scene  follows  from  the 
orientation  of  the  robot  and  the  eye/wheel  calibration 
data. 

2.3  Coordinate  systems  and  transforma¬ 
tions 

The  coordinate  systems  ,  as  shown  in  Figure  1,  are 
W  (the  World  coordinate  system,  with  a  vertical  z- 
axis),  R  (the  Robot  coordinate  system),  C  (the  Cam¬ 
era  coordinate  system),  and  P  (the  coordinate  system 
used  for  the  perspective  projection  on  the  retina). 

The  homogeneous  coordinate  transformation  ma- 
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Trc,  the  coordinate  transformation  matrix  from 
R  to  C,  needs  to  be  completely  determined  through 
eye/wheel  calibration.  Tcp,  the  perspective  projec¬ 
tion  matrix,  is  also  determined  through  calibration. 
Therefore,  we  have 
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where  u  and  v  represent  the  coordinates  in  pixels  of  a 
point  on  the  image  plane. 

2.4  Projecting  segments  of  known  3-D 
orientation 

In  this  section,  we  draw  inferences  about  the  likely 
3-D  orientation  of  the  line  segments  from  their  2-D 
projections  in  the  image  plane.  With  a  pinhole  per¬ 
spective  projection  model,  lines  parallel  to  each  other 
in  the  3-D  scene  will  converge  to  a  vanishing  point 
in  the  2-D  projection  (refer  to  Figure  2).  The  knowl¬ 
edge  of  vanishing  points  provides  useful  information 
about  the  structure  of  the  scene  and  about  the  cam¬ 
era’s  orientation  with  respect  to  the  scene  [2,  3,  10). 
In  particular,  if  the  orientation  of  the  camera  relative 
to  the  scene  is  known,  a  vanishing  point  can  be  com¬ 
puted  for  each  given  3-D  orientation  before  the  image 


I77‘) 


3  processed.  All  the  lines  that  have  a  given  orienta- 
ion  in  S-D  mu5<  pass  through  the  associated  vanishing 
point  when  projected.  Conversely,  if  a  line  does  not 
pass  through  a  vanishing  point,  it  cannot  have  the  3- 
D  orientation  associated  with  that  vanishing  point.  In 
practice,  if  a  line  does  pass  through  a  vanishing  point 
when  projected,  it  is  likely  to  have  the  associated  3-D 
orientation.  The  results  of  Section  3  will  show  how 
likely  this  is  in  practical  situations. 

To  summarize,  the  line  detection  algorithm  of  Sec¬ 
tion  3  knows  in  each  point  of  the  image  plane  the  ori¬ 
entation  that  a  projected  line  segment  would  have  if  it 
had  one  of  the  predefined  3-D  orientations.  Therefore, 
the  basic  idea  is  to  detect  the  2-D  segments  with  one 
of  the  possible  orientations,  and  mark  them  with  the 
associated  3-D  orientation  hypothesis. 
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Figure  1;  The  coordinate  systems 


To  vauihing  point  V, 


Figure  2:  The  eflfects  of  the  heading  on  segment 
orientations 


3  Detecting  segments  and  estimating 
their  3-D  orientation 

3.1  Overview  of  the  algorithm 

The  processing  can  be  outlined  as  follows  (see  |7] 
for  details): 

1.  Line  support  region  extraction:  compute  the  an¬ 
gle  between  the  intensity  gradient  at  each  pixel 
and  the  expected  direction  of  the  projection  of 
each  3-D  orientation-  Use  a  loose  threshold  to  al¬ 
low  for  noise  in  the  gradient  orientation.  Reject 
improper  pixels  and  3-D  orientations.  Using  the 
lormulation  of  [7],  the  entire  line  support  extrac¬ 
tion  is  reduced  to  8  additions  and  11  multiplica¬ 
tions  per  pixel  and  per  3-D  orientation. 

2.  Non-maximasuppression:  keep  only  the  local  gra¬ 
dient  maxima  along  the  estimated  perpendicular 
to  the  line. 

3.  Pixel  linking;  create  chains  of  pixels  using  a  par¬ 
tial  neighborhood  search  in  the  direction  of  the 
estimated  vanishing  points.  This  creates  noisy 
linear  chains. 

4.  Line  fitting:  perform  a  least-squares  fit  of  line  seg¬ 
ments  to  the  pixel  chains.  The  pixel  chains  whicli 
cannot  be  closely  approximated  with  a  line  seg¬ 
ment  are  broken  recursively  into  smaller  chains. 

5.  Global  orientation  check;  compute  the  match  be¬ 
tween  each  line  and  each  3-D  orientation,  like  in 
the  line  support  extraction  step  but  with  a  much 
tighter  threshold. 

If  the  robot’s  a  priori  roll,  pitch  and  heading  are 
very  uncertwn,  the  lines  will  be  extracted  with  loose 
thresholds,  the  true  angles  will  be  estimated,  and  the 
algorithm  can  then  be  run  again  with  tight  thresholds 
for  the  correct  categorization. 

3.2  Results 

The  algorithm  was  implemented  in  C  on  an  HP  730 
workstation,  and  tested  on  hundreds  of  indoor  images 
obtained  by  our  mobile  robot  (see  [8]  for  a  video  pre¬ 
sentation  of  the  results).  The  predefined  3-D  orien¬ 
tations  are  the  vertical  and  the  two  horizontal  orien¬ 
tations  perpendicular  to  each  other  and  aligned  with 
the  axes  of  our  building.  Figures  3  to  5  show  the  re¬ 
sults  of  line  extraction  for  one  image  in  a  sequence 
(The  intensity  image  is  deformed  because  the  camera 
is  equipped  with  a  6  mm  wide-angle  lens,  as  desctil  ed 


in  Section  5).  The  processing  time  is  only  1-2  seconds 
for  each  512  by  480  image,  from  the  intensity  image  to 
the  list  of  categorized  segments.  \  he  fast  speed  can  be 
explained  partly  by  the  absence  of  multi-cycle  floating¬ 
point  instructions  from  the  line  orientation  equations, 
when  properly  expressed.  The  lines  are  not  broken  up 
easily  by  a  noisy  gradient  orientation,  and  the  output 
quality  does  not  degrade  abruptly  with  high  image 
noise,  provided  that  the  thresholds  for  local  gradient 
orientations  ate  loosened.  A  few  misclassifications  oc¬ 
cur  in  some  parts  of  the  images,  but  are  marked  as 
ambiguities. 


Figure  3:  The  input  intensity  image 
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Figure  4:  The  line  segments  associated  with  each 
3-D  orientation 


Table  1  compares  the  real  and  computed  3-D  orien¬ 
tation  of  1439  detected  segments  f.om  eight  images  in 
three  different  environments.  The  presence  of  people 
’n  some  scenes,  as  welt  as  noise  in  the  radio  transmis- 
•sion  of  images  did  not  seem  to  generate  many  mis- 
classifications.  The  percentages  in  each  column  sum 
to  more  than  100  %  because  of  the  multiple  classifica- 
t;on  of  ambiguous  segments. 


Figure  0  All  of  the  2-D  scgmi'nts 
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Table  1:  Hypothesizing  3-D  orientations;  suc¬ 
cess  and  error  rates 


4  Using  information  from  a  single  im¬ 
age 

4.1  Computing  the  robot’s  orientation 

4.1.1  Algorithm 

In  this  section  we  show  how  the  robot's  roll,  pitch 
and  heading  may  be  computed  a  posterton  from  the 
2-D  segments  extracted  in  Section  3  To  achieve  the 
best  precision,  each  angle  should  be  evaluated  from 
the  vanishing  point  which  varies  the  most  with  that 
angle  We  begin  with  the  roll,  which  is  best  derived 
from  the  3-D  vertical  lines  if  the  camera  is  looking  ap¬ 
proximately  forward  Because  of  our  definition  of  the 
heading  (a  rotation  around  the  vertica!  axis  of  W),  the 
vanishing  point  a.ssociafed  with  the  vertical  lines  does 
not  depend  on  the  heading,  if  all  the  2-D  segments 
classified  a.s  3-D  vertical  are  parallel  m  the  image,  the 


vanishing  point  lies  at  infinity,  the  pitch  p  is  zero,  &nd 
the  roll  r  is  the  2-0  orientation  of  the  segments  in 
the  image.  Otherwise,  we  hrst  estimate  the  location 
(uv,tiv)  of  the  vanishing  point  by  a  weighted  least- 
squares  method.  Let  be  the  element  i,j  of  the 
matrix  Trp.  After  algebraic  manipulations,  we  find: 


r  =  arctan 
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Figure  6;  The  path  of  the  robot  (a)  as  estimated 
by  odometry,  and  (b)  after  correction  by  vision 


The  pitch  of  the  robot  may  also  be  computed  from 
the  vanishing  point  of  vertical  lines,  but  for  a  better 
precision,  we  use  another  vanishing  point  more  depen¬ 
dent  on  pitch  and  heading.  In  our  implementation,  we 
use  the  vanishing  point  which  lies  closest  to  the  center 
of  the  image.  We  obtain  the  pilch  and  the  heading 
with  a  method  identical  to  the  computation  of  roll 
described  above 

4.1.2  Results 

The  quadity  of  the  results  depends  on  the  error  on  the 
vanishing  point  extraction  and  the  precision  of  calibra¬ 
tion.  To  illustrate  the  performance  of  heading  compu¬ 
tation  from  individual  images,  we  plotted  the  path  of 
the  robot  around  a  building  floor.  In  Figure  6,  the 
heading  extracted  by  the  vision  algorithm  was  com¬ 
bined  to  the  translation  given  by  odometry  to  estimate 
the  path  of  the  robot.  At  the  beginning  and  end  of  the 
experiment,  the  robot  was  approximately  aligned  with 
the  center  of  a  hallway.  The  robot  stopped  about  five 
meters  before  the  departure  point.  The  total  traveled 
distance  w<is  125  meters,  and  97  images  were  used  to 
compute  the  headings.  The  algorithm  seemed  unaf¬ 
fected  by  people  moving  in  the  field  of  view  and  by  a 
significant  noise  due  to  the  transmission  of  the  images 
by  radio.  The  processing  time  for  a  postehori  heading 
estimation  is  negligible  compared  to  the  time  needed 
for  line  extraction. 

4.2  Image  interpretation 

Interpreting  a  single  segment  image  is  necessary 
*hen  no  3-D  segment  information  is  available.  This 
iccurs  for  the  first  frames  of  a  motion  stereo  sequence, 
ir  when  the  baseline  of  a  stereo  setup  is  too  small  com- 
lared  to  the  depth  of  the  scene.  Ideally,  interpreting  a 
ingle  segment  image  should  not  be  too  different  from 


interpreting  a  3-0  segment  map  to  maintain  consis¬ 
tency  when  the  latter  representation  becomes  avail¬ 
able.  Identifying  objects  by  searching  for  segments  of 
a  particular  3-D  orientation  is  a  good  step  to  achieve 
this  consistency. 

The  inlerpretatioD  of  the  individual  segments  in  a 
single  2-D  image  requires  a  pnori  knowledge  about 
the  type  of  scene.  It  can  be  achieved  by  heuristic 
methods.  The  knowledge  of  the  orientation  of  3-D 
segments  is  particularly  well  suited  to  finding  the  floor 
and  the  walb  in  indoor  images.  Extracting  the  floor 
is  essential  for  map-making,  for  determining  the  free 
apace,  and  for  computing  the  position  of  the  robot 
relative  to  the  visible  environment.  In  most  cases, 
the  outline  of  the  floor  can  be  described  entirely  by 
segments  of  predefined  horizontal  3-D  orientation.  \ 
heuristic  method  can  extract  an  approximate  outline 
of  the  floor  from  a  single  segment  image.  In  real  ap¬ 
plications,  the  floor  may  not  be  perfectly  extracted, 
but  this  first  approximation  can  be  corrected  after  a 
sequence  of  images  becomes  available,  by  estimating 
recursively  the  3-D  position  of  the  segments. 


5  Estimating  the  position  of  3-D  seg¬ 
ments 

5.1  Introduction 

The  recursive  estimation  of  3-D  segments  from  a  se¬ 
quence  of  images  is  the  next  logical  step  toward  scen^ 
understanding.  It  aims  at  estimating  the  depth  in¬ 
formation  that  is  absent  from  2-D  images  Existing 
approaches  include  (1,  4,  5],  to  name  only  a  few, 
Vieville  and  Faugeras  (11)  concentrate  on  estimating 
motion  and  structure  from  monocu/or  segment  images 
and  present  results  on  the  reconstruction  of  a  calibra¬ 
tion  pattern.  Jezouin  and  Ayache  [6]  review  different 


segment-tracking  techniques  using  monocular  images 
and  present  results  on  3-D  reconstruction  from  syn¬ 
thetic  aerial  images. 

Applications  in  autonomous  mobile  robots  imply 
the  ability  to  deal  with  real  indoor  scenes  while  us¬ 
ing  limited  computation  resources.  As  in  the  other 
sections  of  this  paper,  we  aim  at  providing  useful  fea¬ 
tures  for  a  semantic  interpretation  of  the  environment 
of  the  robot.  We  only  consider  the  segments  which 
have  been  extracted  by  the  algorithm  of  Section  3. 
We  will  show  how  this  choice  benefits  the  robustness 
and  speed  of  the  recursive  estimation  process. 

5.2  Preliminary  observations 

A  3-D  segment  should  be  observed  from  very  dif¬ 
ferent  angles  to  be  accurately  reconstructed.  Two  ap¬ 
proaches  can  be  used  to  achieve  this  with  a  monoculu 
camera  The  first  one,  active  vision,  pans  and  tilts  the 
camera  to  keep  some  interesting  features  in  the  field 
of  view.  The  second  approach,  which  we  chose,  con¬ 
sists  of  fitting  the  camera  with  a  wide-angle  lens.  A 
3-D  segment  is  usually  first  detected  when  far  ahead 
of  the  robot,  but  as  the  robot  drives  closer,  the  2-D 
projection  moves  to  one  side  of  the  focus  of  expansion. 
A  short  focal  length  will  enable  the  3-D  segment  to  re¬ 
main  in  the  field  of  view  long  enough  for  an  accurate 
reconstruction.  The  drawback  to  using  a  wide-angle 
lens  is  geometric  distortion.  The  first  processing  step 
is  therefore  to  correct  this  distortion. 

The  3-D  segments  which  are  parallel  to  the  direc¬ 
tion  of  motion  cannot  be  properly  reconstructed  with¬ 
out  consid'-ring  their  endpoints.  The  experience  of  all 
researchers  using  real  images  indicates  that  endpoints 
are  unreliable,  because  2-D  segments  may  be  broken 
or  extended  (due  to  specular  reflection,  for  example). 
Therefore,  we  wiii  not  use  endpoints  to  estimate  the 
parameters  of  infinite  3-D  lines.  There  is  no  justifi¬ 
cation  for  wasting  computation  to  estimate  features 
that  will  not  help  the  interpretation  of  the  scene.  In 
practice,  we  will  not  attempt  to  reconstruct  the  3-0 
segments  parallel  to  the  direction  of  motion.  Those 
segments  are  sent  to  the  semantic  interpreter  with  an 
estimate  of  orientation  but  not  of  position,  unless  the 
robot  changes  its  course.  This  contrasts  with  other 
approaches  that  do  send  an  estimate  of  the  position 
and  orientation  of  those  segments,  both  with  a  huge 
uncertainly. 

Figure  7  presents  a  block  diagram  of  the  recur¬ 
sive  estimation  of  3-D  segments.  As  in  Section  4, 
we  combine  translations  from  odometry  to  rotations 
from  vanishing  points  to  estimate  the  motion  of  the 
robot.  Because  the  rotations  are  obtained  from  each 


2-D  image  with  an  excellent  accuracy,  we  do  not  in¬ 
troduce  any  uncertainty  on  their  parameters.  Trans¬ 
lations,  however,  rely  partly  on  odometry.  Therefore, 
we  model  the  location  of  the  robot  in  the  world  coor¬ 
dinate  system  by  a  3-D  vector  f*  and  its  covariance 
matrix  E(iitf)  under  the  Gaussian  noise  assumption. 


Figure  7.  Recursive  estimation  of  3-D  segments 

5.3  Representation  of  lines 

Since  the  algorithm  of  Section  3  already  provides 
the  most  likely  3-D  line  orientation,  we  only  estimate 
two  positional  parameters  for  each  3-D  line.  Let  us 
define  a  coordinate  system  Ln  for  each  predefined  3-D 
orientation  n  as  follows:  Ln  is  a  rotation  of  W  so  that 
the  3-D  orientation  n  is  parallel  to  the  z  axis  of  L„. 
The  Tl.w  matrices  are  all  known  by  definition.  The 
state  vector  of  a  tracked  3-D  line  is  then  made  of  the 
first  two  components  of  the  coordinates  in  Ln  of  any 
point  of  the  line 
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Because  we  are  not  using  slereovision,  a  problem 
ajtses  when  a  2-D  segment  is  acquired  for  the  first 
time.  Our  approach  is  to  model  it  by  a  state  vec* 
tor  with  arbitrary  depth  and  a  very  large  uncertainty 
in  depth.  After  repeated  observations  from  different 
angles,  the  uncertainty  ellipsoid  associated  with  the 
state  vector  will  shrink  around  the  best  estimate  of 
the  position. 

The  measurement  j/t ,  of  a  2-D  line  in  frame  k  given 
its  associated  3-D  orientation  is  defined  as  follows:  if 
the  line  is  “rather  horizontal,”  it  can  be  described  in 
the  image  plane  by  v  =  au  -f-  y*,,-  with  some  parame¬ 
ter  a.  If  the  line  is  “rather  vertical,”  y*  ,  is  such  that 
u  =  at'  -b  yt  i-  Deciding  whether  a  line  is  rather  hori¬ 
zontal  or  vertical  is  based  on  the  estimated  3-D  orien¬ 
tation,  so  that  singularities  in  the  representation  are 
avoided  yet  consistency  across  frames  is  meuntained. 
The  explicit  state  and  measurement  equations  for  the 
Kalman  filter  are  not  derived  here  for  lack  of  space. 

5.4  Implementation  and  results 

The  implementation  benefits  from  hypothesizing 
the  3-D  orientation  of  lines  before  estimating  their  3-D 
position  in  the  following  ways: 

•  Matching  is  attempted  only  between  segments  of 
the  same  orientation  in  3-D.  In  practice,  other 
implementations  of  3-D  segment  recursive  esti¬ 
mators  use  arbitrary  2-D  orientation  “buckets” 
to  achieve  the  same  purpose  By  matching  seg¬ 
ments  of  given  3-D  orientations,  we  have  a  more 
formal  approach  to  the  problem  of  limiting  possi¬ 
ble  matches. 

•  No  computation  is  wasted  on  segments  which  can¬ 
not  be  reconstructed  because  they  are  pareillel  in 
3-D  to  the  direction  of  motion. 

•  The  problem  of  singularities  in  the  representation 
(see  [6]  for  example)  is  avoided:  although  there  is 
still  a  need  for  different  line  parameterizations, 
the  appropriate  one  needs  to  be  determined  only 
once  for  each  segment  being  tracked. 

•  Traditional  approaches  have  difficulties  dealing 
with  moving  objects  and  a  moving  czunera.  Large 
moving  objects  will  corrupt  the  3-D  map  of  the 
environment  and  yield  incorrect  motion  estimates 
if  the  egomolion  is  estimated  from  the  image.  By 
tuning  the  line  extractor  to  certain  orientations  of 
interest,  it  is  unlikely  that  an  unexpected  moving 
object  could  durably  display  misleading  features. 
Therefore,  the  identification  of  the  visible  objects 


of  interest  will  not  be  significantly  h  mpered  by 
large  moving  objects.  This  is  extremely  impor- 
trmt  in  real-world  situations. 

Several  approaches  are  possible  for  tracking  seg¬ 
ments  screws  images.  If  a  conventional  token-tracker 
is  used  [5],  matching  is  necessary  only  between  seg¬ 
ments  with  the  same  3-D  orientation.  The  results  we 
are  presenting  here  have  been  obtained  with  manual 
token-tracking.  We  are  currently  working  on  a  ne  v 
type  of  token-tracker  that  relies  on  topological  and 
semantic  information  to  eliminate  incorrect  matches. 

Figure  8  shows  the  vertical  segments  in  a  top  view 
of  a  corridor,  as  estimated  after  1,  4,  7,  and  10  images 
in  a  sequence.  The  robot  moved  nine  meters  during 
the  sequence,  or  about  a  quarter  of  the  length  of  the 
corridor  (the  motion  is  from  left  to  right  in  the  fig¬ 
ure).  The  segments  visible  for  the  first  time  arc  given 
an  arbitrary  depth  and  a  very  large  uncertainty  in 
depth.  The  width  of  the  corridor  at  the  location  of 
the  robot  was  estimated  with  an  error  varying  from 
0.4%  to  4.7%. 


Figure  8:  Top  view  of  a  corridor  after  1,  4,  7,  and 
10  images  (uncertainty  of  vertical  segments) 


5.5  High-level  semantic  interpretation 

The  semantic  interpretation  of  the  3-D  segment 
maps  should  be  consistent  with  the  interpretation  of 
single  images  since  the  3-D  information  is  not  present 
or  accurate  for  some  segments.  The  search  for  seg¬ 
ments  associated  with  objects  of  interest  depends  pri¬ 
marily  on  3-D  segment  orientation  and  2-D  image 
topology.  The  position  of  3-D  segments  is  used  in  a 
later  stage  to  verify  object  hypotheses  and  to  estimate 
their  range.  The  reflection  of  segments  in  the  floor,  of¬ 
ten  a  problem  with  2-0  interpretation,  is  handled  by 
clipping  the  3-D  segments  below  the  floor  plane. 


Vertical  segmenta  in  the  top  view 
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Table  2;  Comparing  estimated  door  frame  edges 
to  their  model 


The  semantic  interpretation  of  particular  objects  is 
application-dependent.  To  illustrate  how  the  recursive 
estimation  of  oriented  segments  can  be  used  by  an  in¬ 
terpreter,  we  will  describe  the  detection  of  the  door 
‘'rar  .cs  in  the  corridor  of  Figure  8.  The  door  frames 
are  aligned  with  the  axes  of  the  building,  therefore  seg¬ 
ments  can  be  selected  based  on  their  3-D  orientation 
and  2-D  topology.  To  verify  a  door  frame  hypothesis, 
the  two  selected  vertical  segments  are  checked  for  their 
distance  and  the  3-D  location  of  their  endpoints.  The 
segments  should  also  belong  to  a  3-D  plane  formed  by 
two  predefined  3-D  orientations.  Since  the  positions 
of  the  3-D  segments  are  uncertain,  the  Mahalanobis 
distance  is  used  to  adapt  the  thresholds. 

As  an  illustration.  Table  2  analyzes  the  vertical 
edges  of  the  door  frames  shown  in  the  last  top  view 
of  Figure  8  Ideally,  the  two  vertical  segments  should 
e  iul6  mm  apart  and  the  angle  of  the  door  frame 
in  the  scene  should  be  zero.  The  last  two  doors  are 
about  9  meters  ahead  of  the  robot  and  the  3-D  po¬ 
sition  of  their  segments  is  still  very  uncertain.  The 
number  of  images  in  which  the  third  and  fourth  door 
frames  were  visible  explains  the  high  precision  of  their 
reconstruction. 


6  Conclusion 

We  have  presented  a  new  approach  to  the  interpre- 
Utio  of  monocular  line  images  for  the  navigation  of  a 
mobile  robot.  We  described  algorithms  for  interpret¬ 
ing  single  images  and  for  recursively  updating  inter- 
prclati  ns  with  monocular  sequences  of  images.  The 
quality  and  usefulness  of  the  results  of  each  processing 
stage  justified  the  practical  constraint  placed  on  the 
problem:  the  a  pnlon  knowledge  of  a  few  prominent 
3-D  segment  orientations.  This  constraint  is  reason¬ 


able  since  even  humans  get  disoriented  when  placed 
in  a  tilted  environment,  despite  having  our  own  “in¬ 
clinometers”  in  our  inner  ears.  By  reasoning  in  terms 
of  what  features  the  high-level  interpretation  stage  will 
actually  use,  we  were  able  to  design  lower-level  algo¬ 
rithms  for  better  and  faster  processing.  The  ultimate 
benefits  of  this  approach  were  demonstrated  on  real 
images  in  real  situations,  using  conventional  comput¬ 
ing  hardware,  and  with  image  acquisitions  every  few 
seconds. 
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Abstract— k  digital  spectral  method  to  evaluate  second-order 
distortion  of  a  nonlinear  system,  which  can  be  represented  by 
Voiterra  kernels  up  to  second  order  and  which  is  subjected  to 
a  random  noise  input,  is  discussed.  The  importance  of  depar¬ 
tures  from  the  commonly  assumed  Gaussian  excitation  is  in¬ 
vestigated.  The  Hinich  test,  which  makes  use  of  the  statistical 
properties  of  bispectrum,  is  shown  to  be  an  appropriate  test  for 
orthogonality  in  the  system  identification  of  a  second-order 
Voiterra  system.  Tests  for  Gaussianity  of  two  important 
sources,  which  are  commonly  used  for  Gaussian  inputs  in  non¬ 
linear  system  identification  are  presented:  1)  commercial  soft¬ 
ware  routines  for  simulation  experiments  and  2)  noise  genera¬ 
tors  for  practical  experiments.  The  deleterious  effects  and 
misleading  results  of  assuming  a  Gaussian  input  when,  in  fact, 
it  is  not,  are  demonstrated.  The  random  input  method  to  eval¬ 
uate  (in  terms  of  the  second-order  Voiterra  kernels  and  distor¬ 
tion  factors)  the  second-order  distortion  of  a  nonlinear  system 
is  compared  with  the  sine-wave  input  method  using  both  sim¬ 
ulation  and  experimental  data.  The  proposed  approach  is  uti¬ 
lized  to  model,  identify,  and  quantify  the  linear  response  and 
second-order  distortion  products  of  a  loudspeaker  in  the  low 
frequency  band  by  using  higher  order  coherence  spectra. 


I.  Introduction 

DISTORTIONS  due  to  electromechanical  effects  in  the 
reproduction  component  are  becoming  more  impor¬ 
tant  as  the  state  of  the  art  in  audio  engineering  continues 
to  progress.  The  harmonic  and  intermodulation  distor¬ 
tions,  which  are  most  significant  in  high-fidelity  systems, 
usually  result  from  nonlinear  transfer  characteristics  in 
some  component.  The  measurement  of  harmonic  and  in¬ 
termodulation  distortions,  which  are  the  major  topic  of 
this  paper,  can  be  conducted  with  tone  generators  and  fil¬ 
ters.  For  example,  special  instruments  that  measure  har¬ 
monic  distortion  and  intermodulation  distortion  in  ampli¬ 
fiers  and  other  audio  components  have  been  developed  [IJ. 
A  harmonic  distortion  analyzer  includes  a  tone  generator 
plus  a  high-pass  filter,  which  suppresses  the  fundamental 
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frequency  but  allows  the  overtones  to  be  measured.  Har¬ 
monic  distortion  (HD)  is  expressed  as  a  percentage  [2]; 

_  sum  of  power  in  harmonics  2  and  up 

%  HD  =  - - ; - -  X  1(X)%. 

total  power 

To  measure  intermodulation  distortion,  a  signal  having 
two  frequencies  /,  and  fi  is  supplied  to  the  component  un¬ 
der  test.  Then  the  output  from  the  component  is  fed  into 
a  high-pass  filter,  giving  a  waveform  which  consists  of 
the  high  frequency  carrier  and  its  low  frequency  modu¬ 
lation.  The  percentage  of  intermoduiation  distortion 
(IMD)  is  given  by 

„  amplitude  of  modulation 

%  IMD  =  — - — ; - ^ -  X  I009t. 

amplitude  of  earner 

In  contrast  to  the  relatively  common  sine-wave  input 
approach  to  measure  distortion,  this  paper  describes  a  dig¬ 
ital  higher  order  spectral  technique  to  evaluate  distortion 
for  a  nonlinear  time-invariant  component  which  can  be 
represented  by  Voiterra  kernels  and  'vhich  is  subjected  to 
broad-band  random  inputs.  The  advantages  of  the  random 
input  approach  are:  a)  one  does  not  need  to  repeat  the 
same  experiments  by  changing  frequencies  of  oscillators 
and  filters  to  measure  nonlinear  distortions  over  the  band 
of  frequencies  of  interest,  which  is  a  tedious  and  time- 
consuming  job  in  the  sinusoidal  input  approach,  b)  band- 
limited  noise  is  a  closer  approximation  to  actual  input  sig¬ 
nals,  e.g.,  music  and  speech,  and  c)  we  can  quantify  dis¬ 
tortion  over  the  entire  frequency  band  using  higher  order 
coherence  spectra. 

The  Voiterra  series  approach  [3],  where  the  nonlinear 
response  is  expressed  by  a  sum  of  linear,  quadratic,  and 
higher  order  functionals,  has  recently  attracted  a  consid¬ 
erable  amount  of  interest  in  the  systems  engineering  area 
since  many  physical  systems  exhibiting  inherent  nonlin¬ 
ear  characteristics  can  be  studied  and  modeled  by  this  ap¬ 
proach  [4],  [5).  When  one  attempts  to  use  the  Voiterra 
series  to  describe  a  nonlinear  system,  the  system  model¬ 
ing  problem  reduces  to  the  measurement  of  the  Voiterra 
kernels.  A  relatively  common  assumption  underlying  this 
approach  involves  the  fact  that  the  input  is  assumed  to  be 
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a  random  waveform  which  possesses  Gaussian  statistics, 
an  assumption  which  allows  a  substantial  simplification 
of  the  relevant  mathematics  (6),  [7],  The  assumption  of 
Gaussian  excitation  is  also  utilized  in  many  technological 
areas,  for  example,  the  nonlinear  response  of  offshore 
structures  and  ships  to  random  seas  [8).  In  acoustics,  ex¬ 
periments  have  often  been  performed  under  the  assump¬ 
tion  that  the  random  noise  generators  used  for  an  input 
signal  have  Gaussian  statistics,  see.  for  example,  (9j  and 
[2].  However,  we  will  show  using  Hinich's  Gaussianity 
test  [lOj  that  commonly  used  random  inputs,  such  as  that 
provided  by  a  noise  generator,  cannot  automatically  be 
regarded  as  sufficiently  Gaussian  for  nonlinear  identifi¬ 
cation  studies,  even  though  the  one-dimensional  proba¬ 
bility  density  function  of  the  input  is  approximately 
Gaussian  with  a  small  skewness.  We  illustrate  this  point 
by  utilizing  the  Hinich  procedure  to  test  for  Gaussianity 
of  two  important  sources,  which  are  commonly  used  as 
Gaussian  inputs  in  nonlinear  sy.stem  identification;  1) 
commercial  software  routines  for  simulation  experiments, 
and  2)  noise  generators  for  practical  experiments. 

Recently  Kim  and  Powers  [ll]  presented,  without  as¬ 
suming  particular  statistics  for  the  input  (i.e.,  the  ap¬ 
proach  is  valid  for  both  Gaussian  and  non-Gaussian  in¬ 
puts),  a  practical  digital  method  of  estimating  linear  and 
quadratic  transfer  functions  (the  first-  and  second-order 
Volterra  kernels  in  the  frequency  domain)  of  a  nonliner 
time-invariant  system,  which  may  be  described  by  a  Vol¬ 
terra  series  up  to  second  order.  Furthermore,  the  degree 
of  linearity  and  nonlinearity  (quadratic  in  this  case)  as  a 
function  of  frequency  was  described  by  generalizing  the 
concept  of  coherence  spectra.  In  this  paper,  we  quantify 
the  second-order  nonlinear  distortion  of  audio  compo¬ 
nents  in  terms  of  coherence  spectra  at  each  frequency  [12| 
as  well  as  second-order  Volterra  kernels  at  sum  and  dif¬ 
ference  frequencies.  In  order  to  compare  the  random  input 
method  with  the  classical  two-tone  input  method,  we  es¬ 
timate  the  quadratic  transfer  function,  which  accounts  for 
the  second-order  harmonic  and  intermodulation  distor¬ 
tion,  by  both  methods  and  compare  them.  We  also  dem¬ 
onstrate  the  deleterious  effects  and  the  misleading  results 
of  assuming  a  Gaussian  input  when,  in  fact,  it  is  not  suf¬ 
ficiently  Gaussian  (13],  Finally,  the  approach  is  utilized 
to  model,  identify,  and  quantify  the  linear  response  and 
second-order  distortion  products  of  a  loudspeaker  at  low 
frequency. 

After  the  problem  definition  and  preliminaries  are  given 
in  Section  II,  the  importance  of  taking  into  account  de¬ 
partures  from  Gaussianity  in  the  excitation  is  discussed  in 
Section  III.  Then,  we  describe  the  approach  to  quantify¬ 
ing  nonlinear  distortion  with  digital  higher-order  coher¬ 
ence  spectra  in  Section  IV.  Bichromatic  (two-tone)  versus 
random  input  estimates  for  second-order  distortion  are  de¬ 
scribed  in  Section  V.  In  Section  VI,  we  present  the  results 
of  testing  a  noise  generator  output  for  Gaussianity  and 
present  experimental  results  of  applying  this  approach  of 
this  paper  to  quantify  second-order  distortion  of  a  loud¬ 
speaker.  A  summary  is  included  in  Section  VII. 


II.  Problem  Defi.nition  and  Preliminaries 
Consider  a  single-input/single-output  nonlinear  system 
which  can  be  described  by  the  Volterra  series  up  to  order 
two:  let  y(r)  be  the  output  and  xU)  the  input;  the  system 
is  represented  by 

y(t)  =1  hfit  -  t)x(t)  dr 

Ob  ^  OD 

1  ^i(r  -  T|,  r  ~  T2)x{ry)xiT>)  dr^ 

-  OD  J  -  as 

(1) 

where  hiir)  and  ti)  represent  the  first-order  and 

second-order  Volterra  kernels,  respectively.  In  the  trans¬ 
form  domain,  (1)  may  be  written 

n/)  =  H,{f)X(f)  -E  f  H2{f„f2)X{f,)X{f2) 


■  f'U  -  L  -  h  ^df.dh  (2) 


where  X( /)  and  Y{  f)  are  the  Fourier  transforms  of  jr(/) 
and  >’(/),  respectively.  6(/)  is  the  Dirac  delta  function. 
The  linear  transfer  function  //,(/),  which  can  be  ex¬ 
pressed  in  polar  form  as  |A/,  (/)(  exp  Iy<^i(/)],  is  the 
first-order  frequency  domain  Volterra  kernel.  The  quad¬ 
ratic  transfer  function  //:(/|.  /:)  =  IHjI/,,  /j)!  exp 
/o/:)l  in  polar  form,  is  the  two-dimensional  Fou¬ 
rier  transform  of  /i;(fi.  f;).  The  quadratic  transfer  func¬ 
tion  can  always  be  written  in  a  symmetric  form,  Hjl/. 
fj)  =  //;( ^,/).  without  loss  of  generality  |3].  Since,  in 
general,  ( /| , /; )  will  not  be  equal  to  H;  ( /t ,  -/^ ),  dif¬ 
ferent  values  of  distortion  at  different  frequencies  are  di¬ 
rectly  reflected  in  this  second-order  Volterra  kernel. 

The  second-order  factor  of  intermodulation  distortion 
(D,m2)  at  frequency  /i  ±  /i  is  defined,  in  decibels,  as 


Am2  =  20  log  [IM2]  =  20  log 


^'o(/l  ±/2) 
M/l) 


[dB] 


(3) 


where  Vo(  /,  ±  /?)  is  the  output  voltage  at  the  frequency 
f\  ±  /:  due  to  distortion  and  Po(  /i )  is  the  output  voltage 
at  frequency /,  due  to  the  linear  nature  of  the  system  under 
test.  This  term  may  be  expressed  in  terms  of  the  fre¬ 
quency  domain  Volterra  kernels  in  the  form  (5) 


-  20  log 


^2  ( /i '  ±  /; ) 
«i(/i) 


[dBl  (4) 


where  Vt  is  the  amplitude  of  both  input  voltages  (v,{t)  = 
P/  cos  2x/|  t  -E  V/  cos  2irf2t).  Likewise,  the  second-order 
harmonic  distortion  factor  which  is  defined  as 


Dh2  =  20  log  [//2|  -  20  log 


^^o(2/) 

KAf) 


[dB)  (5) 


may  be  expressed  in  terms  of  the  Volterra  kernels  as 


20  log 


2 


H2(f.f) 

W,(/) 


[dB] 


(6) 
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where  V,  is  the  amplitude  of  the  input  voltage  =  Vj 
cos  2irj^).  Thus,  the  second-order  intermodulation  distor¬ 
tion  factor  D]M2  and  harmonic  distortion  factor  of  a 
component  under  test  can  be  obtained  easily  at  various 
frequencies,  if  the  linear  transfer  function  and  quadratic 
transfer  function  of  the  component  can  be  estimated 
simultaneously  at  those  frequencies. 

Here  we  consider  a  general  technique  for  analyzing  the 
harmonic  and  intermodulation  distortion  of  a  nonlinear 
system  with  a  random  noise  input,  instead  of  sinusoidal 
inputs.  As  indicated  in  earlier  paragraphs,  the  approach 
rests  upon  determination  of  the  Volterra  kernels.  Thus,  in 
this  paper,  we  focus  on  the  determination  of  the  kernels 
and  the  quantification  of  the  goodness  of  the  resulting 
Volterra  model  through  higher  order  coherence  spectra. 
In  general,  one  can  express  (2)  using  the  discrete  form  as 

T(/„)  =  H,(f„)X{f„) 

+  E  H.(f,fj)X(f,)X(fj)  +  e(f„)  (7) 

t  *-j  =  m 


where  X(  f„ )  and  Y(  f„ )  are  the  V-point  discrete  Fourier 
transforms  of  the  input  and  the  output,  respectively.  ((f„) 
is  the  difference  between  the  model  output  Tf  f„  )n,odei  (the 
first  two  terms  on  the  RHS  of  (7))  and  the  true  output 
Y(  f„)  at  frequency  mAf  (A/  =  I /NT,  T:  sampling  pe¬ 
riod).  The  symbol  denotes  the  Nyquist  frequency  as¬ 
sociated  with  the  sampling  of  the  time  domain  signal. 
Since  (7)  is  linear  with  respect  to  the  transfer  functions, 
one  can  obtain  an  optimum  estimation  of  the  transfer 
functions  by  minimizing  the  mean-square  error 
f(|e(  To  simplify  notation,  X( /„)  will  be  denoted 

as  X(m),  Y(  f„)  as  Y(m),  etc.  To  find  the  optimum  solu¬ 
tion.  we  consider 

dHiim) 


3£Ik(m)p) 


i  +  j  -  m 


which  yields  the  following  two  equations: 
ElX*im)Y(m)]  =  H^{m)E[\X{m)\^] 


(8) 


-t-  E  H2{i,j)E{X*{m)X{i)X{j)] 

i~j=m 

(9) 


£{X*(k)X*(l)Y(m)] 

=  H,(m)ElX*(k)X*(l)X(m)] 

+  E  h:U,J)ElX*(k)X*(l)X(i)X(J)]  (10) 

I  =  m 

where  *  denotes  the  complex  conjugation.  In  (9)  and  (10), 
ElX*(m)  Y(m)]  is  the  crosspower  spectrum  between  input 
and  output  £[  |X(m)|^]  is  the  autopower  spectrum 

of  the  input  S„(m).  ElX*{m)X(i)X(j)]  is  the  complex 
conjugate  of  the  auto  bispectrum  of  the  input  y), 

and  E{X*(k)X*(l)  Y(m)]  is  the  cross  bispectrum  (where  k 
+  I  =  m)  between  input  and  output  By^j,(k,  /).  Note  that 


the  fourth-order  spectral  moment  appearing  on  the  right- 
hand  side  of  (10)  becomes  zero  when  k  +  I  ^  i  j  =  m 
because  of  the  propenies  of  higher  order  spectral  mo¬ 
ments  and  thus  (10)  is  considered  only  foi  k  +  I  =  i  +  j 
=  m. 


III.  Gaussianity  in  Quadratic  System 
Identification 


A  relatively  common  assumption  for  the  estimation  of 
the  linear  and  quadratic  transfer  functions  in  (9)  and  (10) 
is  that  the  in^ut  possesses  zero-mean  Gaussian  statistics, 
which  allows  terms  containing  the  third-order  moment 
(i.e.,  the  auto  bispectrum)  of  the  input  to  vanish.  Conse¬ 
quently.  the  estimations  of  the  transfer  functions  for  a 
zero-mean  Gaussian  excitation  are  independent  and  there 
is  no  coupling  between  them  because,  under  Gaussian  ex¬ 
citation,  the  first  two  terms  on  the  right-hand  side  of  the 
Volterra  equation  (7)  are  orthogonal.  Thus,  the  linear  and 
quadratic  transfer  functions  for  a  zero-mean  Gaussian  in¬ 
put  are  obtained  separately  as  follows  (6],  [7): 


Hi(m)  = 


SyAm) 

Sa(m) 


(II) 


Hzd,/) 


25^,(/)S„(y)’ 


i  +  j  *  0 


(12) 


where  the  autopower  spectrum  is  used  in  the  denominator 
of  (12)  since  fourth-order  spectral  moment  appearing  on 
the  right-hand  side  of  (10)  can  be  expressed  in  terms  of 
second-order  spectral  moments  when  the  input  is  Gauss¬ 
ian.  Thus,  we  recover  the  well-known  result  for  the  linear 
transfer  function,  namely,  it  is  equal  to  the  quotient  of  the 
crosspower  spectrum  divided  by  the  autopower  spectrum. 
The  expression  for  the  quadratic  transfer  function,  al¬ 
though  not  as  well  known,  is  somewhat  analogous  and  is 
given  by  the  quotient  of  the  cross  bispectnim  divided  by 
the  autopower  spectra  at  frequency  i  and  j.  The  factor  of 
two  appearing  in  the  denominator  of  (12)  arises  from  the 
symmetry  property,  //^(i.y)  =  //2(>,  0- 
The  assumption  of  Gaussian  excitation  made  in  (1 1)  and 
(12),  which  allows  a  subsuntiai  simplification  of  the  rel¬ 
evant  mathematics,  should  be  accepted  with  caution,  even 
though  the  marginal  distribution  function  of  the  excitation 
is  approximately  Gaussian.  The  Gaussianity  of  an  input 
signal  can  be  tested  several  ways.  One  of  these  is  a  mo¬ 
ment  test  of  the  univariate  marginals  of  the  data  as  de¬ 
scribed  by  Pearson  [14];  sample  coefficients  of  skewness 
and  kurtosis  are  used  to  reject  or  accept  the  Gaussianity 
hypothesis  at  a  certain  confidence  level.  The  Gaussianity 
of  the  signal  can  be  also  checked  by  goodness-of-fit  tests, 
such  as  the  Kolmogorov-Smimov  test  and  chi-square  test 
[15].  All  of  these  tests  focus  on  the  univariate  marginal 
distribution.  The  distribution  of  the  marginals  is  com¬ 
pared  to  a  Gaussian  univariate  distribution.  However,  as 
pointed  out  by  Brockett  et  al.  [16],  the  univariate  mar¬ 
ginal  distribution  of  the  time  series  may  satisfy  Gaussian¬ 
ity  tests  even  if  the  joint  distribution  for  several  time 
points  is  non-Gaussian.  For  this  reason,  we  use  the  so- 


103: 


IEEE  TRANSACTIONS  ON  SIGNAL  PROCESSING.  VOL  40.  NO  J.  MAY  1992 


called  Hinich  test  [lOJ  to  examine  Gaussianity  of  the  sig¬ 
nal.  In  Hinich’s  approach,  a  sample  bispectrum  is  used  to 
construct  a  statistic  to  test  whether  the  bispectrum  of  the 
signal  is  nonzero.  A  rejection  of  the  null  hypothesis  im¬ 
plies  a  rejection  of  the  hypothesis  that  the  signal  is  Gauss¬ 
ian.  More  specifically,  a  sample  bispectrum  of  an  input 
signal  {x(0),  jr(l),  •  •  •  .  x{N  -  1)}  is  first  defined  by 

F{i,j)  =  N-'Xii)Xij)X*{i  j).  (13) 

To  obtain  a  consistent  estimate  of  the  bispeclrum,  the 
sample  bispectrum  is  smoothed  over  a  square  of  points 
centered  at  ((2m  -  \)M/2.  (2n  -  l)Af/2)  in  the  two- 
dimensional  frequency  plane.  The  estimator  is 

mAf  -  1  nM-\ 

Bux(m,  n)  =  L  S  F{i,j).  (14) 

»  =  (m  1)W  y  « <ff  -  \  )M 

Since  the  asymptotic  distribution  of  each  estimator  in 
(14)  is  complex  normal,  the  distribution  of 

f  ^ 'll _  (15) 

tiV/Af']'''^[4(m)5„(n)S„(m  -E  n)]'/^ 

is  complex  normal  with  unit  variance.  Here,  5^^  is  the  es¬ 
timator  of  the  power  spectrum  of  jc(r).  Consequently, 
is  approximately  chi-square  with  two  degrees  of 
freedom.  Since  Bt„(m,  n)  is  zero  for  all  m  and  n  if  the 
process  is  Gaussian,  the  statistic 

r  =  2ZZ|L,„|'  (16) 

m  n 

is  asymptotically  distributed  x^(2^)  under  the  null  hy¬ 
pothesis  of  Gaussianity.  Here,  P  is  the  degree  of  freedom 
defined  in  a  principle  domain  [10].  For  large  N  and  thus 
large  P,  (2/*) ~ ‘  x ^ (2/*)  is  approximately  normal.  Finally, 
Gaussianity  can  be  determined  by  shifting  and  scaling  the 
statistic  to  standardize  and  compare  it  to  the  probability 
of  exceeding  its  value  under  a  Gaussian  zero-mean  unit- 
variance  distribution. 

This  sample  bispectrum  test  is  especially  meaningful 
when  we  estimate  linear  and  quadratic  transfer  functions 
of  a  second-order  Volterra  series  under  the  Gaussian  input 
assumption.  Since  the  process  of  estimating  the  linear  and 
quadratic  transfer  function  for  a  zero-mean  Gaussian  in¬ 
put  is  based  on  the  assumption  that  the  terms  containing 
the  third-order  moment  of  the  input  signal  in  (9)  and  (10) 
vanish,  the  direct  test  for  Gaussianity  of  the  input  would 
be  to  examine  whether  the  autobispectrum  of  the  input 
vanishes  or  not.  Note  that  moment  coefficient  of  skew¬ 
ness,  E[x^(n)]/F\x^{n)]^^^,  in  the  classical  Gaussianity 
test  examines  Gaussianity  of  x(n)  only  for  zero  lags  in  the 
third-order  moment.  If  the  autobispectrum  of  the  input 
vanishes,  the  linear  and  quadratic  transfer  functions  au 
obtained  separately  from  (9)  and  (10),  respectively  (i.e., 
orthogonal  model).  However,  this  test  does  not  check 
Gaussianity  of  the  fourth-order  spectral  moment  of  the  in¬ 
put  signal,  which  is  assumed  to  be  expressed  in  terms  of 
second-order  spectral  moments  when  the  input  signal  is 
Gaussian.  Thus,  the  Hinich  test  may  be  regarded  as  a  nec¬ 
essary  but  not  sufficient  test  for  Gaussianity  in  quadratic 


nonlinear  system  identification.  Gaussianity  tests  based  on 
the  trispectrum,  which  is  the  next  higher  order  spectrum 
after  bispectrum,  are  currently  being  investigated  [17]. 
For  cubic  nonlinear  system  identification  Gaussianity  of 
the  input  signal  should  be  tested  up  to  the  sixth-order  mo¬ 
ment. 


IV.  Nonlinear  Coherence  Analysis 

For  non-Gaussian  inputs,  one  has  to  solve  (9)  and  (10) 
simultaneously  with  respect  to  H|(m)  and  H2U,  j).  Re¬ 
cently,  Kim  and  Powers  [11]  developed  the  estimation 
technique  of  A/j  (m)  and  Hi  (i,j)  for  the  general  input  case 
by  expressing  (7)  in  vector  notation  and  solving  the  equa¬ 
tion  under  the  minimum  mean  square  criterion  as  follows: 

I'('”)niodei  =  H(m)X(m)  (17) 

where  the  boldface  letters  denote  a  vector  quantity,  and 


where  N  is  the  number  of  sampled  data  per  realization. 
Furthermore, 


,r^(m)  =  [X,(m),  Jr[(m)]  (20) 
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where  ^  denotes  the  transpose  of  a  matrix.  Then  the  op¬ 
timum  solution  for  H(m)  in  (17),  under  the  minimum 
mean-square-error  criterion,  is  given  by 

Him)  =  E[X*(m}X^im)r'  E[X*(m)Y{m)].  (22) 


The  linear  and  quadratic  transfer  functions  (m)  and 
HzU,  j)  estimated  by  (22)  can  be  used  to  determine  the 
second-order  harmonic  distortion  factor  at  frequency 
mA/by  (6)  and  the  second-order  factor  of  intermodulation 
distortion  Dmz  at  frequency  (i  -I-  j)Afby  (4).  In  the  sin¬ 
usoidal  input  approach,  the  harmonic  and  intermodulation 
distortions  over  the  band  of  frequencies  of  interest  are 
measured  by  repeating  the  same  experiment  at  various 
frequencies  and  by  changing  frequencies  of  oscillators  and 
filters,  which  is  a  tedious  and  time-consuming  job.  With 
broad-band  random  input  the  distortions  can  be  estimated 
simultaneously  over  the  whole  frequency  band  of  interest 
as  long  as  the  input  has  a  broad  (nonzero)  spectrum  over 
the  band  of  frequencies  of  interest  and  the  data  set  is  large 
enough  to  replace  the  expectation  operator  in  (22)  by  an 
ensemble  average  over  a  sufficient  number  of  sample 
spectra. 

Another  important  advantage  of  the  random  input  ap¬ 
proach  is  that  the  nonlinear  distortion  can  be  quantified  in 
terms  of  coherence  spectra,  which  is  the  principal  focus 
of  this  paper.  In  a  single-channel  linear  system,  it  is  well 
known  that  the  coherence  spectrum  which  is  a 

normalized  crosspower  spectrum,  is  defined  as 


7^(/) 


i£I|y*(/)y(/)]|^  = 
£I|X(/)l']£T|r(/)|-]  5„(/)S,,(/)' 


true  output  power.  For  example,  one  can  derive  a  discrete 
version  of  (23)  from  the  above  interpretation  as  follows: 

^  ^  |//,(m)p£T|X(m)p) 

£l|y(m)p] 

uuc  £11  y(m)p] 

S^(m)Syy(m) 

where  the  expression  for£fi(m)  in  (11)  is  utilized.  Thus, 
one  can  interpret  y^im)  as  the  fraction  of  output  power 
which  can  be  accounted  for  by  the  linear  model  and  1  - 
7^(m)  is  the  fraction  of  output  power  which  cannot  be 
accounted  for  by  the  linear  model. 

The  coherence  spectrum  can  be  generalized  using  the 
above  definition,  for  the  second-order  Volterra  system  of 
(7)  in  the  frequency  domain  [11],  [20].  By  using  the  es¬ 
timates  of  Hi  (m)  and  H2  (i,j)  computed  by  (22),  the  model 
output  power  spectrum  Sy)  ('”)mod«i  can  be  expressed  as 
follows: 

iSyy(m)^odel  ^  I  ^(^)l*  Imodel 

=  £[lyt(m)|']  -E  E\\YQ(m)\^] 

-E  2  Re  £[y£(m)  l^>(m)*] 


=  Slim)  -E  Sgim)  -E  SiQini)  (25) 

where  Yi(.m)  and  YQ(m)  denote  the  first  two  terms  on  the 
RHS  of  (7),  respectively,  and  where 

Sdm)  =  lHi(mfEllX(m)\-]  (26) 


SQ(m)  =  E 


S  7/2 (r.  m  -  i)X{i)X(m  — 

i 


127) 


Since  |5),t(/)j'  s  ,.(/),  the  coherence  spectrum 

is  bounded,  i.e.,  0  <  y'f/)  <  1.  Statistically  speaking, 
the  coherence  spectrum  is  a  measure  of  the  degree  of  lin¬ 
ear  correlation  (or  linear  relationship)  between  output  and 
input  as  a  function  of  frequency.  The  ordinary  concept  of 
the  coherence  function  is  that  X(f)  and  Y(f)  are  per¬ 
fectly  correlated  at  the  frequency  /when  7  ^  ( /  )  =  1 ,  and 
they  are  completely  uncorrelated  when  7’ (/I  =  0.  How¬ 
ever,  this  interpretation  is  valid  only  for  a  linear  system. 
The  term  y'(f)  defined  in  (23)  represents  only  the  linear 
relationship  between  the  input  X(  f)  at  frequency /and  the 
output  Y(  f)  at  the  same  given  frequency.  Bendat  and 
Piersol  [19]  have  pointed  out  that  a  reduction  of  coher¬ 
ence  may  be  due  to  a)  extraneous  noise  in  the  measure¬ 
ments,  b)  a  nonlinear  relationship  between  input  and  out¬ 
put,  and/or  c)  multiple  relations  to  other  quantities.  Thus, 
when  more  than  one  frequency  component  of  the  input 
couple  and  interact  in  a  nonlinear  system  to  produce  com¬ 
ponents  with  new  frequencies  in  the  output,  the  coherence 
spectrum  defined  in  (23)  does  not  represent  the  proper  re¬ 
lationship  between  the  input  and  output. 

A  useful  interpretation  of  the  ordinary  coherence  spec¬ 
trum,  which  serves  as  a  basis  to  generalize  the  concept, 
is  that  it  is  the  ratio  of  the  model  output  “power”  (in  the 
sense  of  mean-squared  value  per  unit  frequency)  to  the 


SLQ(m)  =  2  Re  //,(m)  S  H^{i,  m  -  i)ElX*(i) 


■  X*(m  -  i)X(m)]j.  (28) 

Slim)  is  the  linear  contribution  to  the  model  output  power 
at  frequency  mAf,  5g(/«)  is  the  quadratic  nonlinear  con¬ 
tribution  to  the  model  output  power,  and  5i.e(m)  is  the 
“interference  part”  of  the  model  output  power.  For  a  per¬ 
fect  Gaussian  input,  SiQ(m)  is  zero  because  the  autobi¬ 
spectrum  E\X*{i)X*(m  —  i)X{m)]  is  zero  and  there  is 
no  coupling  between  the  linear  and  quadratic  outputs  (i.e. , 
the  model  is  orthogonal). 

Dividing  both  sides  of  (25)  by  the  actual  output  power 
spectrum  Syy  (m),  we  can  obtain  the  generalized  coherence 
spectrum,  which  characterizes  the  “goodness  of  fit”  of 
the  model,  as  follows: 


£t|T{m)j’U,  El\Y(m)\^] 

=  ylifn)  +7q(w)  +  yloim) 

where 


ylim) 


Sdm) 

S„(m) 


(29) 

(30) 
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yQ(m)  = 


Spim) 

Svv(m) 


(31) 


where  the  terms  7i(m).  ygim),  and  ylpim)  are  the  linear 
coherence  spectrum,  quadratic  coherence  spectrum,  and 
coupled  coherence  function  of  the  second-order  Volterra 
model,  respectively.  yl(m)  and  ypim)  are  the  fractional 
contributions  to  the  output  power  at  frequency  mA f  due 
to  linear  transfer  function  H\  (m)  and  quadratic  transfer 
function  j),  respectively.  As  discussed  above, 

yioCnr)  is  nonzero  for  non-Gaussian  inputs,  and  denotes 
the  fraction  of  the  model  output  power  at  frequency  mA f 
due  to  the  “interference”  of  the  output  of  the  linear  and 
quadratic  transfer  functions.  The  terms  ylim)  and  ypim) 
are  positive,  but  not  bounded  by  unity  since  ylpim)  can 
be  positive  or  negative  depending  on  the  phase  relation¬ 
ship  between  the  linear  response  and  quadratic  response. 
However,  the  total  coherence  y^(m)  at  each  m  is  bounded 
by  zero  and  unity.  Possible  reasons  for  coherence  loss.  1 
-  7‘(m),  are  a)  the  neglect  of  higher  order  (third  or 
higher)  terms  in  the  model,  b)  additive  noise  in  the  mea¬ 
surements,  and  c)  extraneous  inputs. 

For  the  Gaussian  input  case  S^pim)  and,  hence, 
yipim)  are  zero  since  the  linear  and  quadratic  outputs 
Ylim)  and  Ygim)  are  orthogonal.  In  this  case  one  can  ob¬ 
tain  ylim)  and  ygim)  using  (1 1),  (12),  (26),  (27),  (30), 
and  (31)  as  follows: 


2  ,  I^vt(w)I~ 

S„im)S,.yim) 

2  _  1  2  |gv^,(/.  m  -  01^ 

2  /  5„(/)5„(m  -  i)S„im)' 


(33) 

(34) 


Notice  that  ylim)  in  (33)  is  identical  with  (24)  for  a  linear 
model,  and  ygim)  in  (34)  is  identical  with  the  coherence 
spectrum  derived  from  a  purely  quadratic  model  (7] .  That 
is,  the  generalized  system  coherence  spectra  defined  by 
(30)-(32)  with  (26)-(28)  reduces  to  the  ordinary  coher¬ 
ence  spectra  given  by  (33)  and  (34)  when  the  input  exci¬ 
tation  is  zero-mean  Gaussian. 


V.  Bichromatfc  Versus  Random  Process  Estimates 

In  the  previous  section,  the  random  input  approach  to 
estimate  second-order  harmonic  and  intermodulation  dis¬ 
tortion  of  the  component  under  test  is  described.  The 
measurement  of  nonlinear  distortions,  however,  has  tra¬ 
ditionally  been  conducted  by  supplying  sinusoidal  signals 
and  measuring  the  extent  to  which  harmonic  or  combi¬ 
nation  tones  are  generated  in  the  output  of  the  component 
under  test.  In  order  to  calculate  the  response  of  the  quad- 
ratically  nonlinear  system  subject  to  two  sinusoidal  inputs 
with  frequency /,  and /;  ( /,  >  /,),  let  the  input  be 
•> 

Mt)  =  S  \Vi  \  cos  {Ivfjt  -I-  Bj]  (35) 


which  can  also  be  expressed  as 

2 

xir)  =  5  S  K,  exp  lj2r/,r}  (36) 

j  =  -2 


where 


=  y*,  0,  /-,  = 

The  linear  response  to  the  inputs  specified  by  (35)  is  given 
by  (18] 

1 

y,it)  =  E  |K,,|  |//,(y:)|  cos  llrfit  +  0,  -h  <^,(/)]. 

i «  \ 


(37) 


The  quadratic  response  to  the  input  specified  by  (36)  is 
given  by 


>-2(0  =  E  y,„yi,,H2if,„f.,) 


I  *  -3 


■  exp  I72t(/,  -h/j)r].  (38) 

Since  Hii-fx,  -fi)  =  H^A^fi)  and  W2(/i-  /j)  = 
/2,/i )  in  real  systems,  (38)  can  be  expressed  as 

2 

>-2(0  =  E  \y,fmf„f,)\ 

t »  1 


•  cos  l2Ti2/i)i  +  2f  ^2  (/./)] 

+  2||//,||K;,||//2(/„/2)| 

•  cos  |2x-(/,  +  f2)r  -f  0|  -t-  02  +  <hif\^A)] 

+  2\y,My,,\\H2ifu  -A)\ 

•  cos  [2t(/|  -f2)t  -t-  0,  -  02  +  <^2(/|.  -/2)J 

2 

-I-  E  \y,fH2if„  -fi).  (39) 

I  *  \ 


The  first  term  in  (39)  denotes  second-order  harmonic  dis¬ 
tortions,  the  second  term  denotes  an  intermodulation  dis¬ 
tortion  at  the  sum  frequency,  the  third  term  denotes  an 
intermodulation  distortion  at  the  difference  frequency,  and 
the  last  term  denotes  the  DC  component.  Note  that  the 
second  and  third  terms  accounting  for  intermoduiation 
distortion  contain  the  factor  of  two  while  the  first  term 
accounting  for  harmonic  distonion  does  not.  Note  also 
that  the  output  at  the  difference  frequency,  f  -  f 2,  con¬ 
tains  not  only  the  intermoduiation  distortion  at  that  fre¬ 
quency  but  also  the  second  harmonic  distortion  due  to  fre¬ 
quency  /2  if  /i  -  /a  =  2/2.  These'  harmonic  and 
intermoduiation  distortions  (or  //2(/i-  -fi)  and  H^ifi. 
/a))  can  be  identified  by  supplying  only  one  sinusoidal 
signal  first  to  measure  harmonic  distortion  (or  //2(/2,/2 )) 
and  by  subtracting  the  harmonic  distortion  from  the  total 
distortion  subject  to  two  sinusoidal  signals  to  obtain  in¬ 
termoduiation  distortion.  One  can  also  see  that  the  output 
at  the  difference  frequency  f  -  /2  contains  the  inter¬ 
moduiation  distortion  at  that  frequency  as  well  as  linear 
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response  at  frequenc\  /;  if/,  -  /  =  Z-  The  intermodu- 
lation  distortion  (or  W;  i  /, ,  -/; ))  can  be  obtained  by  sup¬ 
plying  only  one  sinusoidal  signal  first  to  measure  linear 
response  y,  (?)  and  by  subtracting  y,  (?)  from  the  total  re¬ 
sponse  corresponding  to  the  two  sinusoidal  signals. 

In  order  to  compare  the  two-tone  input  method  with 
random  input  method,  a  known  system  is  used  in  a  sim¬ 
ulation  experiment  as  follows  [11]: 

y(?)  =  -0.7.r(?)  -  0.9jc(?  -  2)  +  0.8x^(?) 

-f-  0.5.t-i?  -  2).  (40) 


The  quadratic  transfer  function  of  the  above  system  is 
given  by 


fiiif.f,)  =  0.8  -  0.5  exp  [-y4ir(/  +  fj)].  (41) 

In  Fig.  1  we  present  an  amplitude  and  a  phase  of  H2(f, 
fj)  estimated  by  the  two-tone  input  method,  discussed  in 
this  section;  i.e.,  by  changing  frequencies  of  two  sinu¬ 
soidal  signals  and  measuring  the  reponse  at  the  sum  and 
difference  frequencies.  Note  that  it  is  enough  to  plot  only 
the  upper  and  lower  triangular  region  of  //:(/,/)  in  the 
/,  and  /  plane  because  of  various  symmetry  properties. 
Values  in  the  upper  triangular  region,  where /,  >  0,/2  > 
O./i  >  /.  represent  distortions  at  sum  frequencies,  while 
values  at  the  lower  triangular  region,  where  /,  >  0,/2  < 
0-  /i  -  I  /:  I  ■  represent  distortions  at  difference  frequen¬ 
cies.  The  values  located  on  the  45“  line  in  the  upper  tri¬ 
angular  region  correspond  to  harmonic  distortion  while 
the  others  correspond  to  intermodulation  distortions.  The 
valley  between  those  two  regions  exists  since  distonions 
due  to  a  sinusoidal  signal  interacting  with  DC  component 
are  not  calculated  and  assigned  a  value  of  zero. 

Fig.  2  shows  the  amplitude  and  phase  of  //jC/, /)  es¬ 
timated  by  the  random  input  method  described  in  Section 
IV.  For  this  simulation  the  output  data  are  generated  by 
applying  a  zero-mean  exponential  input  (IMSL  GGEXN) 
to  the  known  system  (40).  The  input  and  output  data  rec¬ 
ords  were  divided  up  into  2000  segments  of  64  samples 
each.  Each  data  segment  is  Fourier  transformed  via  the 
FFT  to  calculate  various  sample  spectra  in  (22)  and  aver¬ 
aged  to  obtain  the  spectral  estimates.  As  expected,  the 
quadratic  transfer  function,  accounting  for  second-order 
distortion,  estimated  by  the  two-tone  input  method  shows 
good  agreement  w  ith  the  one  estimated  by  the  random  in¬ 
put  method  except  for  small  fluctuations.  Once  the  trans¬ 
fer  functions  are  estimated,  distortion  factors  can  be  cal¬ 
culated  by  substituting  the  estimateo  transfer  functions 
into  (4)  and  (6).  For  example.  at  frequency  5Af(Af 
=  1/64  Hz)  is  -11.3  (-11.1)  dB,  D,m2  at  8A/  -  7A/ 
is  1.1  (1.7)  dB.  and  at  8A/-f-  7A/is  -10.9  (-10.5) 
dB  (the  values  in  parenthesis  are  estimated  by  the  random 
input  method  and  are  preceded  by  the  values  estimated  by 
the  two-tone  input  method).  Traditionally,  distortion  fac¬ 
tors  are  measured  by  (3)  and  (5)  with  sinusoidal  signals, 
which  give  the  same  results  obtained  by  (4)  and  (6)  with 
sinusoidal  signals. 


(a) 


(b) 

Fig.  1.  Amplitude  and  phase  of  quadratic  transfer  function  //jlf.  y)  (sec 
(41))  estimated  by  two-tone  input  method;  (a)  estimated  amplitude,  (b)  es¬ 
timated  phase. 


Fig.  2.  Amplitude  and  phase  quadratic  transfer  function  H,  (i.y)  estimated 
by  random  input  method:  (a)  estimated  amplitude,  (b)  estimated  phase. 
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VI.  Testing  for  Gaussianity  and  Experimental 
Results 

A.  Experimental  Setup 

Loudspeakers  tend  to  be  the  weakest  component  in  most 
high-fidelity  sound-reproducing  systems.  The  nonlinear 
distortion  of  the  loudspeaker  is  typically  a  maximum  at 
the  lower  frequencies.  The  principal  cause  for  nonlinear 
distortion  at  the  lower  frequencies  is  nonuniform  Bl  prod¬ 
uct  versus  the  excursion  of  the  voice  coil.  Also,  nonlin¬ 
earities  in  the  compliance  of  the  suspension  and  surround 
are  not  negligible.  The  prominent  nonlinearities  are:  a) 
the  Bl  product,  b)  the  self-inductance  of  the  voice  coil, 
and  c)  the  nonlinear  compliance  of  the  suspension  and 
surround  [21],  a)  and  b)  may  result  from  a  nonuniform 
flux  distribution. 

The  canonical  equation  for  an  antireciprocal  transducer 
at  low  frequencies  can  be  given  by  the  following  when  air 
loading  is  neglected: 


Bli  =  mJc  +  R„x  +  kx 

(42) 

V,  .  «,/  + 

*  dt 

(43) 

where  Bl  is  the  force  factor,  i  is  the  voice-coil  current,  x 
is  the  voice-coil  excursion,  and  m,  R„,  and  k  are  the  mov¬ 
ing  mass,  the  mechanical  damping,  and  the  stiffness  of 
the  mass-spring  system.  Vg  is  the  generator  voltage, 
and  Lf  are  the  electrical  resistance  and  inductance  of  the 
voice  coil. 

Kaizer  [21]  modeled  the  loudspeaker  in  the  low-fre¬ 
quency  band  by  a  Voiterra  series  expansion  after  approx¬ 
imating  Bl,  k,  and  L,  in  (42)  and  (43)  by  a  truncated  power 
series  as  follows: 


Bl  =  BIq  +  bix  -1-  bix^ 

(44) 

k  ~  liQ  +  kjX  -F  k2X^ 

(45) 

Le  =  LfQ  +  l\X  -F  liX^. 

(46) 

The  amount  of  distortion  will  depend  on  many  design  pa¬ 
rameters,  such  as  magnet  size,  voice-coil  design,  suspen¬ 
sion,  etc.  Also,  it  depends  on  the  input  power  level.  As 
pointed  out  by  Greiner  and  Sims  [22],  distortion  can  in¬ 
crease  dramatically,  for  input  power  above  a  few  watts. 
In  general,  minimizing  cone  travel  by  using  large-diam- 
5ter  high-efficiency  drivers  tends  to  reduce  distortion. 

To  demonstrate  she  feasibility  and  practicality  of  this 
aaper’s  proposed  approach,  the  second-order  Voiterra  se¬ 
ries  has  been  utilized  to  model,  identify,  and  quantify  the 
inear  response  and  the  second-order  distortion  product  of 
oudspeakers  in  the  low-frequency  region.  The  arrange- 
nent  of  Fig.  3  wts  used  to  measure  the  distortion  perfor- 
nance  of  the  compression  driver  typical  of  those  used  to 
Irive  acoustic  horns.  The  noise  source  used  puts  out  a  flat 
loise  spectrum  band  limited  to  500  Hz.  A  low-pass  filter 
vith  a  cutoff  frequency  of  420  Hz  was  inserted  before  the 
implifier  which  was  adjusted  to  drive  the  compression 
Iriver  at  10  W.  The  compression  driver  was  placed  in  an 


Fig.  5-  Block  diagTam  of  distonion  analysis  setup.  Digital  signal  proces¬ 
sor:  DSP56000.  Motorola;  Low-pass  filler:  Krohn-hite  mode  I50AR  (cut¬ 
off  frequency  =  420  Hi);  Probe  condenser  microphone  cartridge:  type  4 1 3J 
Band  K;  Microphone  power  supply;  type  2804,  Band  K. 

anechoic  duct  with  a  probe  (condenser  microphone  car¬ 
tridge).  The  microphone  power  supply  with  a  probe  and 
amplifier  used  in  this  experiment  is  assumed  linear  over 
the  band  of  interest  (10-500  Hz)  from  the  specifications 
provided  by  the  manufacturer.  A  digital  signal  processor 
(DSP  56000)  took  98  304  data  samples  from  the  input  and 
output  signal  at  a  1-kHz  sampling  rate.  These  were  di¬ 
vided  up  into  384  segments  of  128  samples  each.  In  order 
to  compute  various  sample  spectra  appearing  in  (22),  each 
data  segment  is  Fourier  transformed  via  the  FFT,  and  then 
the  384  sample  spectra  were  averaged  to  get  the  spectral 
estimates.  The  frequency  resolution  A/ is,  therefore,  500 
Hz  -F  64  samples  =  7.8125  Hz. 

B.  Testing  for  Gaussianity 

In  this  section,  we  test  the  noise  generator  for  Gauss¬ 
ianity.  First,  we  examine  the  Gaussian  hypothesis  by  per¬ 
forming  two  types  of  tests.  The  first  of  these  is  the  mo¬ 
ment  test  of  univariate  marginals  as  described  by  Pearson 
[14].  On  the  basis  of  5000  samples  from  the  noise  gen¬ 
erator  waveform,  the  following  values  are  found: 

skewness  =  -0.0123;  kurtosis  162  =  3.11934. 

According  to  the  table  provided  by  Pearson  the  hypothesis 
should  be  rejected  at  0.05  level  of  significance  if  lies 
outside  the  range  -0.057  to  0.057,  and  at  0.01  level  of 
significance  if  lies  outside  the  range  -0.08!  to  0.08 1 . 
The  observed  value  lies  inside  the  limit.  Also,  the  hy¬ 
pothesis  should  be  rejected  at  0.05  level  of  significance  if 
P2  lies  outside  the  range  2.89  to  3.12,  and  at  0.01  level 
of  significance  if  /Jy  lies  outside  the  range  2.85  to  3.17. 
The  observed  value  lies  inside  the  limit.  Thus,  the  Gauss¬ 
ian  hypothesis  is  not  rejected  on  the  basis  of  this  test. 

The  histogram  of  the  noise  generator  estimated  from 
the  same  data  set  (5000  samples)  is  shown  in  Fig.  4.  One 
may  assume  that  the  signal  from  the  noise  generator  is 
nearly  Gaussian  from  the  reasonably  bell-shaped  histo¬ 
gram  with  a  skewness  of  -0.0123.  Thus,  when  one  deals 
with  a  linear  system  or  univariate  marginal  distribution  of 
a  signal,  the  noise  generator  output  can  often  be  used  in 
experiments  as  a  Gaussian  source.  However,  as  was 


CHO  fi  a!.:  DIGITAL  TECHNIQUE  TO  ESTIMATE  SECOND  ORDER  DISTORTION 


(037 


Fig.  4.  Histogram  of  the  signal  from  the  noise  generator  shown  in  Fig.  3 
(variance  =  0.026.  skewness  =  -0.0123.  kurtosis  =  3.119). 

pointed  out  by  Brockett  f't  al.  [16],  the  univariate  mar¬ 
ginal  distribution  of  the  time  series  may  satisfy  Gaussian- 
ity  tests  even  if  the  joint  distribution  of  several  time  points 
is  non-Gaussian.  Thus,  if  one  wants  to  analyze  the  non¬ 
linearity  of  the  system  by  applying  the  noise  generator  to 
a  nonlinear  system  as  a  Gaussian  input  signal,  the  Gauss- 
ianity  statistics  should  be  carefully  investigated  before¬ 
hand,  since  the  univariate  marginal  distribution  of  the  in¬ 
put  signal  may  pass  classica'  statistical  tests  for 
Gaussianity  without  the  signal  being  sufficiently  Gaussian 
for  nonlinear  system  identification  purposes.  If  the  input 
signal  is  actually  non-Gaussian,  the  Gaussian  assumption 
may  lead  to  incorrect  estimates  of  the  Volterra  kernels, 
since  the  higher  order  spectra  used  to  estimate  the  kernels 
3  2  sensitive  to  depanures  from  Gaussianity. 

Since  we  want  to  estimate  the  linear  and  quadratic 
transfer  function  of  the  loudspeaker  by  applying  a  noise 
generator  as  an  input  source,  the  Hinich  test  is  used  to 
examine  the  Gaussianity  of  the  input  signal.  To  become 
familiar  with  Hinich’s  Gaussianity  statistic  value,  first  we 
test  the  Gaussianity  of  two  known  signals:  the  first  is  a 
zero-mean  Gaussian  generated  by  IMSL  GGNPM,  and  the 
second  is  a  zero-mean  exponential  input  generated  by 
IMSL  GGEXN.  The  Gaussianity  statistic  value  (standard¬ 
ized  value  of  the  statistic  T  in  (16))  for  the  Gau:.sian  signal 
is  -1.36,  which  is  not  large  enough  to  reject  Gaussianity 
[16].  The  Gaussianity  statistic  value  for  the  exponential 
signal  is  75.55,  which  is  large  enough  to  reject  the  Gauss¬ 
ianity  hypothesis,  as  we  expected. 

Next,  the  Gaussianity  of  the  signal  from  the  noise  gen¬ 
erator  is  tested.  The  resulting  statistic  value  by  the  Hinich 
test  is  102.10  (49  152  samples).  All  Gaussianity  statistic 
values  for  the  noise  generator  data  records  generated  un¬ 
der  different  conditions  (different  sample  points,  different 
power,  different  cutoff  frequency)  were  large  enough  to 
reject  the  Gaussianity  hypothesis.  Thus,  one  can  see  that 
the  signal  from  the  noise  generator  cannot  be  used  as  a 
Gaussian  source  for  quadratic  system  analysis.  It  can, 
however,  be  used  to  identify  the  quadratic  system  if  we 
utilize  the  “general  input"  approach  described  in  Section 
IV. 

C.  Experimental  Results 

Since  the  input  signal  from  the  noise  generator  to  the 
nonlinear  loudspeaker  system  is  not  sufficiently  Gaussian, 


(a) 


(b) 

Fig.  5.  Amplitude  of  the  loudspeaker  quadratic  transfer  function  Wi(i,  j) 
experimentally  estimated  using;  (a)  two-tone  input  method,  (b)  random  in¬ 
put  method. 

one  has  to  solve  (9)  and  (10)  simultaneously  for  H,  (m) 
and  //2(/, y).  The  linear  and  quadratic  transfer  functions 
are  estimated  according  the  solution  given  by  (22).  The 
estimated  Hid.  j)  (smoothed  over  adjacent  frequencies), 
accounting  for  second-order  distortion,  is  shown  in  Fig. 
5(b).  To  compare  the  proposed  technique  with  the  con¬ 
ventional  analog  two-tone  method.  Hi  (i.  j)  is  measured 
by  the  method  described  in  Section  V.  Two  oscillators, 
both  of  them  having  the  same  amplitude,  are  applied  to 
the  amplifier.  The  output  of  the  amplifier,  which  gener¬ 
ates  a  sinusoidal  signal  at  10  W,  is  connected  to  the  com¬ 
ponent  under  test  to  calculate  the  distortions  at  sum  fre¬ 
quencies  and  difference  frequencies  by  (39).  Hiii.j).  in 
fig.  5(a),  measured  by  the  two-tone  input  method,  shows 
good  agreement  with  the  one  obtained  using  the  random 
input  approach  except  for  relatively  small  fluctuations. 
Note  the  large  values  along  the  diagonal  line  ir  the  dif¬ 
ference  interaction  zone,  which  suggests  that  difference 
frequency  generation  dominates  the  distortion  process. 

Distortion  factors  of  the  loudspeaker  can  be  calculated 
by  substituting  the  estimate,;  transfer  functions  into  (4) 
and  (6),  Fig.  6  shows  the  results  for  at/,  -F  /i  for  a 
fixed  frequency  fi(=  lA/)  as  a  function  of  frequency  /). 
Reasonable  agreement  between  the  two-lone  input  method 
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Frequency  {Hx] 

(b) 

Fig.  8.  Actual  and  model  output  power  spectrum  of  loudspeaker  with  noise 
generator  input:  (a)  linear  part  of  model  output  power  spectrum  Si,  quad¬ 
ratic  part  of  model  output  power  spectrum  Sq,  and  interference  part  (ab¬ 
solute  value)  of  output  power  model  spectrum  SlQ,  (b)  measured  rmiput 
power  spectrum  S,„.  total  model  output  power  spectrum  =  Si  +  Sq 
+  ^LQ,  and  linear  part  of  model  output  power  spectrum  Si. 


Fig.  9.  System  coherence  spectra  and  output  power  spectra  of  loud¬ 
speaker:  (a)  system  coherence  spectra  for  loudspeaker  Voherra  models  cal¬ 
culated  using  general-input  method  and  assumed-Caussian  input  method, 
(b)  output  power  spectrum  of  loudspeaker:  actual  (observed)  spectrum  and 
output  power  spectra  calculated  from  Volterra  models  determined  using  the 
general-input  method  and  assumed-Gaussian  input  method. 


Gaussian  excitation  case.  The  system  coherence  calcu¬ 
lated  for  the  assumed  Gaussian  input  method  is  plotted  in 
Fig.  9(a)  and  we  note  that  except  at  low  frequencies  it  is 
considerably  greater  than  unity.  In  fact,  this  plot  indicates 
that  the  Volterra  model  calculated  using  (11)  and  (12) 
overestimates  the  response  of  the  loudspeaker  by  up  to  a 
factor  of  3.  On  the  other  hand,  as  shown  in  Fig.  9(a),  the 
Volterra  model  calculated  on  the  basis  of  (22)  and  valid 
for  general  inputs  (non-Gaussian  as  well  as  Gaussian) 
yields  a  system  coherence  which  is  close  but  somewhat 
less  than  unity  over  the  entire  frequency  band.  In  Fig. 
9(b)  we  plot  the  actual  power  spectrum  of  the  loudspeaker 
response  when  driven  by  the  noise  source,  and  the  re¬ 
sponses  “predicted”  by  the  Volterra  models  estimated 
using  the  assumed  Gaussian  input  method  and  the  general 
input  method.  Again  we  see  that  the  general  input  method 
“predicts”  a  response  which  is  in  much  closer  agreement 
with  that  which  is  actually  observed.  On  the  other  hand, 
the  assumed  Gaussian  input  method  overpredicts  the  re¬ 
sponse  over  the  vast  majority  of  the  frequency  spectrum. 
In  summary.  Fig.  9  graphically  illustrates,  by  way  of  ex¬ 
ample,  the  deleterious  effects  of  assuming  Gaussian  ex¬ 
citation  when,  in  fact,  it  is  not. 


VII.  Summary 

In  this  paper,  a  digital  higher  order  spectral  method  to 
evaluate  the  second-order  distortion  of  a  nonlinear  sys¬ 
tem,  which  can  be  represented  by  Volterra  kernels  up  to 
second  order  and  which  is  subjected  to  a  random  input, 
was  described.  The  common  assumption  that  the  input 
possesses  Gaussian  statistics  was  investigated.  The  pre¬ 
liminary  results  indicate  that  one  must  carefully  check  the 
degree  of  Gaussianity  of  random  signal  generators  used 
in  experimental  nonlinear  system  identification  studies. 
Specifically,  we  have  shown  that  it  is  not  sufficient  to 
demonstrate  that  the  marginal  distribution  is  approxi¬ 
mately  Gaussian.  In  view  of  the  fundamental  role  the 
bispectrum  plays  in  the  system  identification  of  second- 
order  system,  the  Hinich  test,  which  makes  use  of  the 
statistical  properties  of  the  bispectrum,  appears  to  be  par¬ 
ticularly  appropriate. 

After  checking  the  Gaussianity  of  the  noise  generator 
and  demonstrating  that  it  is  not  sufficiently  Gaussian,  the 
general  input  method  was  utilized  to  estimate  the  Volterra 
linear  and  quadratic  transfer  function  of  a  nonlinear  sys¬ 
tem.  Then,  the  quadratic  transfer  function  accounting  for 
second-order  distortion  was  estimated  by  applying  (bi- 
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chromatic)  sinusoidal  signals  to  the  system  to  compare  the 
random  input  method  with  the  classical  two-tone  input 
method.  Relatively  good  agreement  was  obtained  for  both 
simulation  and  experiment.  Finally,  the  low-frequency 
loudspeaker  distortion  is  quantified  in  terms  of  nonlinear 
coherence  spectra  and  the  relative  nonlinear  contribution 
to  the  output  power. 

In  the  future,  the  random-input  method  of  this  paper 
will  be  extended  to  handle  cubic  distortion.  Furthermore, 
the  technique  might  ultimately  be  used  to  construct  a  non¬ 
linear  inversion  circuit  for  the  purposes  of  compensating 
distortion.  Because  of  the  generality  of  the  Volterra  ap¬ 
proach,  the  approach  of  this  paper  can  be  utilized  to  quan¬ 
tify  distortion  in  other  components  of  an  audio  system. 
Moreover,  it  can  be  applied  to  measure  distortion  intro¬ 
duced  by  any  type  of  system,  including  all  types  of  com¬ 
munication  systems. 
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A  Digital  Method  of  Modeling  Two-Input  Quadratic 
Systems  with  General  Random  Inputs 

C  K.  An,  E  J.  Powers,  and  C  P.  Ritz 

Ahstraet—K  digital  metliod  of  estlmaling  linear  and  quadratic  trans¬ 
fer  functions  of  a  two-input/multiple-output  qoadraticaliy  nonlinear 
system  with  gencrai  (i.e.,  non-Gaussian  as  well  as  Gaussian)  random 
inputs  is  described.  The  approach  is  based  un  a  frequcucf  Uw.nain  sec¬ 
ond-order  Votterra  functional  scries  representation  for  a  iwo-inpul/ 
muitipie-output  system. 

I  Ntroductio.v 

It  is  the  objective  of  this  correspondence  to  present  a  digital 
method  of  estimating,  from  the  raw  input  and  output  time  series 
data,  the  linear  and  quadratic  transfer  functions  necessary  to  char¬ 
acterize  a  quadratically  nonlinear  system  with  two  inputs  and  mul¬ 
tiple  outputs  The  transfer  functions  are  expressed  in  terms  of  vanous 
polyspectra  (power  spectra,  bispectra,  and  fourth-order  spectra) 
which,  in  turn,  are  digitally  estimated  from  the  raw  input  and  out¬ 
put  time  series  data  Since  the  approach  is  valid  for  general  (e  g  . 
non-Gaussian)  inputs.  Gaussian  inputs  are  included  as  a  special 
case.  Our  approach  is  based  on  a  frequency  domain  second-order 
Volterra  functional  senes  representation  (1)  for  a  iwo-input/mul- 
tiple-output  system.  The  nonlinear  transfer  functions,  i.e  ,  the 
Fourier  transforms  of  the  Volterra  kernels,  measure  (he  strength  of 
the  nonlinear  coupling  between  various  frequency  components  pre¬ 
sent  in  the  input  and  output  signals,  and  thus  enable  one  to  quantify 
the  rate  at  which  energy  is  exchanged  between  various  modes  of 
the  nonlinear  system. 

Since  the  values  of  the  higher  order  spectral  moments  used  in 
determining  the  Volterra  transfer  functions  are  very  sensitive  to 
departures  from  Gaussianity.  it  is  important  to  take  such  departures 
into  account.  Such  departures  may  arise  because  of  the  past  non- 
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linear  history  of  the  signal  (e  g  .  as  occurs  in  transition  to  turbu¬ 
lence  {2)).  or  if  the  source  of  the  exciting  signal  is  noi  sufficienily 
Gaussian  [3] 

Since  the  approach  closely  follows  that  onginally  developed  for 
single-input  quadratically  nonlinear  syiicms.  the  reader  is  referred 
to  (4|  for  specihc  deuils  Also  see  (5) 

II  Estimation  o;  T"’.-  ;rrr  Flnctiov' 

In  this  correspondence  we  assume  that  the  unknown  nonittsear 
system  is  of  second  order,  thus  higher  order  terms  may  be  ne¬ 
glected  Since  our  approach  is  based  on  a  frequency  domain  sec¬ 
ond-order  Volterra  functional  senes  representation  (with  the  dc  term 
subtracted),  we  will  start  from  the  following  input-output  telatioo- 
ship  in  the  discrete  frequency  domain: 

flm)  «  Li(m)X,(m)  +  Lj(m)X:(m) 


M  4 

-v  Y.  m  - 

■  »  » 

m  4 

♦  Y  2{3j2(<.  m  - 

N.  < 

+  L'  0i;(l,  m  -  1 

i)X,(i  )X}lm  - 

+  Y  Qitii.  m  -  1 
>  •  $ 

i)X;(i)X,(m  - 

where  s  =  Im  +  l)/2  and  m  is  odd.  Form  even,  similar  expres¬ 
sions  may  be  obtained  In  Fig.  I  is  shown  the  two-dimensional 
frequency  plane  over  which  the  vanous  quadratically  nonlinear 
transfer  functions  are  defined.  Because  of  symmetry  considerations 
It  is  necessary  to  determine  the  quadratic  transfer  functions  in  re¬ 
gions  5,  and  D,  only.  Note  that  in  order  to  avoid  aliasing  in  the 
output  of  a  quadratically  nonlinear  system  we  have  assumed  the 
input  IS  band  limited  to  /„  *.  where  denotes  the  sampling  fre¬ 
quency  of  both  the  input  and  output.  In  (I)  it  is  assumed  that  the 
number  of  data  points  in  one  segment  is  M.  thus  the  upper  limit  in 
the  summations  is  M/4,  which,  of  course,  corresponds  to/.,/,. 
Note,  that  a  schematic  interpretation  of  (I)  is  provided  in  Fig.  2. 

We  may  rewrite  ( 1 )  using  vector  notation  as  follows 

Y{m)  =  *'{m)fc(m)  (2) 

where  t  denotes  transpose,  and  where  the  elements  of  x(m)  and 
fc(m)  are  similar,  although  generalized  for  the  two-input  case,  to 
those  of  14], 

Next,  we  solve  for  h(m)  in  such  a  way  as  to  minimize  the  mean- 
square  error  between  the  model  output  Ylm)  and  the  actual  ob¬ 
served  output  K(m).  The  solution  is  determined  by  solving  the  fol¬ 
lowing  matrix  equation  for  h(m)  (4|, 

£{4r*(/n)K(m)}  =  E{x*(m)*'(m)}A(m)  (3) 
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Fig  I  Two-dimensional  frequency  dotnaimn  which  qusdraiically  nonlin¬ 
ear  transfer  functions  are  defined. 


Fig.  2.  Quadratically  nonlinear  system  model  with  two  inputs 


where  x*(m)  is  the  complex  conjugate  of  x{m).  Equation  (3)  is  a 
linear  matrix  equation  in  terms  of  the  transfer  function  vector  fi(m), 
consequently  the  solution  for  Aim)  is  given  by 

A(m)  =  [£{x*(m)x'(m)}]~  ‘E{x*(m)y(m)}  (4) 

if  £{x*(m)-r'(m)}  in  (4)  is  not  singular. 

The  LHS  of  (3)  consists  of  cross-power  spectra  and  cross  bi¬ 
spectra  between  the  two  inputs  and  the  output;  and  the  matrix 
E{x*(m)x’(m)}  in  (3)  consists  of  autopowcr  spectra,  cross-power 
spectra,  autobispectra,  cross-bi spectra,  autofourth-order  spectra  and 
cross-fourth-order  spectra  of  the  two  inputs.  Since  the  bispcctra 
and  fourth-order  spectra  are  multidimensional  functions  of  fre¬ 
quency,  the  estimation  is  computationally  very  expensive.  When 
the  inputs  are  Gaussian,  considerable  simplification  results  in  that 
the  autobispectra  are  zero,  and  the  fourth-order  spectral  moments 
can  be  expressed  in  terms  of  sums  of  products  of  second-order 
spectra  (i.e.,  power  spectra).  For  instance,  when  W  =  64  and  m  = 
2.  for  each  realization  only  162  matrix  elements  must  be  calculated 
under  the  zero-mean  Gaussian  assumption.  However,  the  number 
of  elements  we  must  estimate  for  the  non-Gaussian  case  is  2145 
for  each  realization.  Since  2016  elements  of  the  2145  elements  are 
fourth-order  spectra,  almost  all  of  the  computational  effort  is  spent 
to  estimate  the  fourth-order  spectra  in  the  matrix  E(x*(m)x'(m)}  in 
(3)  when  the  inputs  arc  not  zero-mean  Gaussian. 

HI.  Analysis  of  Simulation  Data 
A.  Known  Quadratic  System 

The  general  input  method  described  in  the  previous  section  has 
been  applied  to  simulation  data  in  order  to  demonstrate  the  feasi¬ 


bility  and  conectness  of  the  approach  The  diurete-iinne  syssem 
used  in  the  simulation  is  given  as  follows 

>fn)  =  -0  64i:(n>  ■<-  i,(n  ~  2)  -  0  64.r;U!)  -  xdn  -  2l 

*■  0  y-r;(ni  -t-  0,5.«;(n  -  li  +  O  qtjjit)  *  0.5x;(rt  -  2) 

^  1  IXiinLijtnl  0  fijCilnUjtri  -  1) 


v  0  3»,(/i  -  IU;(n)  +  1  3r 

,(n  ~ 

IU.,(/i 

-  1) 

(5) 

Therefore 

the  linear  and  quadratic  transfer 

fuiKiions  are 

given  by 

Uf) 

=  -0.64  -r  e 

(6) 

L,if) 

*  -0  64  -  <■  •*’' 

(7| 

=  09-1-  0.5e''**’^'^' 

(8) 

0  9  -I-  0.5e 

(9) 

=  1  2  -t-  0  3e  -v  0.6<- 

-V  1  5e 

'  /.  • 

(10) 

=  12  -V  0.6f  +  0  3e 

+  1.5e 

/.  •* 

(11) 

For  a  quantitative  measure  of  the  quality  of  the  estimates,  the 
percent  root  mean  square  errors  (%RMSE|  of  the  estimated  linear 
and  quadratic  transfer  functions  are  dehned  as  follows: 


%RMSE,  = 


jjL  V  ~ 


X  100|%1 


(12) 


9(RMSE, 


/l  V  V  -  f?fr->)i‘ 

•  -  !<?(».  2  )!•' 


X  10OI%] 


(13) 


where '  signifies  an  estimated  quantity,  the  absence  of  a  hat  denotes 
a  ’true  ’  value  and  3/,  is  the  number  of  frequency  points  over  which 
the  linear  transfer  function  is  calculated  and  A',  is  the  number  of 
frequency  points  in  the  regions  5|  and  D,.  Note  that  in  each  case 
the  error  is  averaged  over  all  relevant  frequencies. 

The  approach  developed  in  the  previous  section  has  been  tested 
by  utilizing  exponentudly  distributed  white  input  signals  x,(n)  and 
jri(n).  which  are  partially  correlated  with  each  other.  These  signals 
are  low-pass  filtered  with  a  cutoff  frequency  of  a  quarter  {/  =  0.25) 
of  the  sampling  frequency  of  the  input  signals.  Generated  via  (5) 
are  320  000  sample  point  data  records  for  both  the  input  and  output 
signals.  They  are  divided  into  5000  segments  of  64  data  points 
each.  The  various  higher  order  spectral  moments  needed  in  (4)  are 
computed  by  ensemble  averaging  over  the  appropriate  sample  spec¬ 
tra.  which  are.  in  turn,  estimated  from  discrete  Fourier  transforms 
of  each  segment.  First,  to  illustrate  deleterious  effects  of  assuming 
Gaussian  excitation,  when  it  is  in  fact  not.  we  estimate  all  the  linear 
and  quadratic  transfer  functions  assuming  Gaussian  excitation  and 
compute  the  %RMSE.  Next  we  estimate  the  transfer  functions, 
using  the  more  general  method  of  this  paper. 

When  we  use  the  assumed  Gaussian  input  method,  the  average 
%RMSE  of  the  two  linear  transfer  functions  is  40.3%  and  that  of 
the  four  quadratic  transfer  functions  is  43.4%.  If  we  use  the  general 
input  method  for  the  same  exponential  input  signals,  the  average 
%RMSE  of  the  linear  transfer  functions  drops  to  3.4%  and  that  of 
the  quadratic  transfer  functions  to  4.6%.  Thus,  the  average 
%RMSE’s  of  the  transfer  functions  calculated  assuming  Gaussian 
inputs,  when  in  fact  they  are  not  Gaussian,  are  approximately  10 
times  greater  than  those  of  the  transfer  functions  estimated  by  the 
general  input  method. 

With  vectorization,  the  computation  time  of  the  general  input 
method  is  about  34  seconds  while  computing  on  a  CRAY  X-MP/ 
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24  supercomputer.  This  fact  suggests  that  the  general  input  method 
developed  in  this  correspondence  is  highly  vectorizablc. 

B.  Output  Power  and  Coherence  Spectra 

To  further  test  the  validity  of  the  estimated  transfer  functions, 
the  mode!  output  power  spectrum  is  compared  with  the  actual  out¬ 
put  power  spectrum,  the  latter  being  computed  from  y(n)  in  (5), 
For  this  purpose,  new  exponentially  distributed  input  data,  which 
were  not  used  to  estimate  the  model  transfer  functions,  are  applied 
in  order  to  construct  the  model  output.  The  actual  and  model  output 
power  spectra  are  shown  in  Fig.  3(a).  The  system  coherence  spec¬ 
trum  Tsysieml"*)  is  defined  by 

where  ^,{m)  is  the  model  output  power  spectrum  and  is  the 
true  output  power  spectrum  for  frequency  m.  Since  the  system  co¬ 
herence  spectrum  is  the  fraction  of  the  actually  observed  output 
power  spectrum  which  can  be  accounted  for  by  the  model,  values 
near  unity  indicate  a  good  model  and.  hence,  good  transfer  function 
estimates.  As  shown  in  Fig.  3(b),  the  system  coherence  spectrum 
up  to  /  =  0.25  is  near  unity,  suggesting  the  estimated  linear  and 
quadratic  transfer  functions  are  good  models  of  the  two-input  quad- 
ratically  nonlinear  system  over  this  frequency  range. 

In  order  to  quantify  the  relative  “importance"  of  the  various 
linear  and  nonlinear  transfer  functions  appeanng  in  the  model,  we 
now  proceed  to  express  the  overall  system  coherence  in  terms  of 
its  component  partial  coherence  spectra.  Quantitatively,  we  can 
represent  the  model  output  K(m)  at  frequency  m  as  follows: 

K(m)  =  n  -  Yi,  +  Yq,,  +  Yg.,  +  Yq,,  (15) 

where  fjj,  Yq„.  and  Yq...  represent  the  model  outputs  of  L,. 
L2.  Q}i.  and  Q22  in  Fig  2,  respectively.  The  term  takes  into 
account  the  outputs  of  both  Q,2  and  Qi  ,,  since  Q,2  and  O21  actually 
represent  a  single  quadratic  transfer  function  for  which  the  inputs 
are  X,  and  Xj. 

The  model  output  power  spectrum  P,(m)  is  obtained  by  multi¬ 
plying  (15)  by  Y*(m),  then  taking  an  expected  value.  Next  we  di¬ 
vide  the  equation  for  F,(m)  by  the  actual  (i.e.,  “true")  output  power 
spectrum,  Py(m)  =  ElY*(rn)Y(m)].  and  we  observe  that  the  LHS 
becomes  the  system  coherency  spectrum  7sy5«n,(m)  and  the  terms 
on  the  RHS  represent  the  fractional  contributions  (i.e.,  partial  co¬ 
herence  spectra)  of  each  transfer  function  to  the  model  output  power 
spectrum  at  frequency  m,  i.e.. 

Tsysteraf"*)  =  >1.1  +  72.2  7ll  11  +  722.22  7l2.l2  (16) 

+  71.2  (17) 

+  7l.ll  +  7l  22  ■*-  7i.12  +  72.11  +  72.22  +  72.12  (18) 

+  711.22  +  7n.i:  +  722.12-  (19) 

For  example,  71  |  represents  in  fraction  form,  the  contribution  to 
the  total  output  power  at  frequency  m  of  the  linear  L|  transfer  func¬ 
tion  alone.  Similarly.  72;,;:  represents  the  fractional  contribution 
to  observed  output  power  at  a  frequency  m  of  the  quadratic  Q22 
transfer  function  alone,  and  so  on. 

Note  that  contributions  of  those  terms  given  in  (16)  are  intrin¬ 
sically  positive.  On  the  other  hand,  the  contributions  to  the  model 
output  power  represented  by  the  terms  in  (17)-(I9)  result  from  the 
interaction  of  the  complex  outputs  of  different  transfer  function 
models.  If  the  input  is  exactly  Gaussian  all  six  terms  in  (18)  will 
be  zero,  since  they  involve  odd  order  spectral  moments.  It  is  par- 
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Fig.  3  (al  Comparison  of  actual  and  model  output  power  spectra  of  non¬ 
zero-mean  exponentially  distributed  inputs  by  general  input  method,  (b> 
system  coherence  spectrum 
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Fig.  4.  Partial  system  coherence  spectra  estimated  by  using  general  input 
method  for  zero-mean  exponentially  distributed  inputs 

licularly  important  to  note  that  the  terms  in  (17)-(19)  may  be  neg¬ 
ative.  since  they  correspond  to  the  real  part  of  a  complex  quantity. 

In  Fig.  4  we  have  plotted  the  dominant  seven  of  the  15  partial 
coherence  spectra  for  the  simulation  experiment.  Note  that  those 
terms  dependent  upon  a  linear  transfer  function  exist  only  in  the 
region  0  <  /  <  0.25,  because  of  the  antialiasing  filters  applied  to 
the  inputs.  Observe,  in  accordance  with  our  previous  discussions, 
that  all  the  terms  appearing  in  (16)  are  positive,  while  7,  n  and 
72,22  are  both  positive  and  negative.  Because  of  the  nature  of  this 
simulation  example,  all  partial  coherence  spectra  shown  in  Fig.  4 
are  of  comparable  maximum  amplitude,  but  peak  up  at  different 
frequencies.  Although  of  no  physical  significance  in  this  simulation 
example,  such  plots  are  capable  of  potentially  providing  significant 
insight  into  the  nonlinear  transfer  of  energy  from  various  spectral 
bands  in  the  two  inputs  to  other  spectral  bands  in  the  output  of  a 
quadratically  nonlinear  system. 

IV.  Conclusion 

There  are  a  number  of  practical  applications  where  it  is  important 
to  have  the  capability  to  model  dual-input  nonlinear  systems,  some 
examples  of  which  follow:  ground  vibration  tests  of  aircraft  where 
two  random  exciting  forces  are  applied  in  order  to  inject  sufficient 
energy  into  all  modes  of  the  system;  and  experimental  studies  of 
transition  to  turbulence  in  fluids  and  plasmas,  where  the  two  inputs 
consist  of  two  different  velocity  components,  and  plasma  density 
and  potential  fluctuations,  respectively.  The  approach  described  in 


IEEE  TRANSACTIONS  ON  SIGNAL  PROCESSING.  VOL  »  NO  10,  OCTOBER  IWt 


this  correspondence  is  currently  being  utilized  to  investiggie  such 
applications. 
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Bispectral  magnitude  and  phase  recovery  using  a 
wide  bandwidth  acousto-optic  processor 


Stacy  K.  Kniffen,  Michael  F.  Becker,  and  Edward  J.  Powers 


A  hybrid  optical-digital  processor  has  been  developed  that  computes  both  the  magnitude  and  phase  of  the 
biapectrum  for  wide  bandwidth  ( 10  MHi  to  1  GHz)  rf  signals  The  overall  optical  architecture  is  that  of  a 
modified  Mach-Zehnder  interferomeur  that  contains  three  aoousto-optic  modulators  and  appropriate 
transforming  lenses  The  intensity  distribution  in  the  output  plane  of  the  interferometer  contains  an 
interference  term  that  represents  the  real  part  of  the  bispectrum  multiplied  by  a  apatial  earner  ‘the 
interference  fringes)  To  isolate  the  bispectrum  information,  the  output  image  u  digitized  and  digital!) 
filtered  The  imaginary  part  of  the  bispectrum  is  obtained  by  Hilbert  transforming  the  real  part,  and  then 
computing  the  bispectrum  magnitude  and  phaae.  The  procetaor  is  tested  with  four  different  combinations 
of  rf  test  signals  Each  signal  has  a  bandwidth  of  either  6  or  12  MHz.  Teat  results  that  iliustraie  the 
performance  of  the  processor  in  the  recovery  of  magnitude  and  phaae  information  for  the  bispectrum  of 
quadraticaliy  related  signals  are  presented 


I.  Introduction 

The  bispectrum  is  an  example  of  a  class  of  functions 
called  higher -order  spectra  or  polyspectra.  Such  func¬ 
tions  are  useful  in  the  analysis  of  nonlinear  phenom¬ 
ena  and  nonlinear  systems.  Estimates  of  the  poly  spec¬ 
tra  (which  include  the  power  spectrum  as  a  special 
case)  are  defined  mathematically  as  expected  values 
of  successively  higher-order  products  of  Fourier  trans¬ 
forms.  For  the  case  of  the  bispectrum,  this  is  a  tnple 
product’; 

=  lim  r  f.  +  h)}.  ill 

where  X(f)  is  the  Fourier  transform  of  a  random 
process  xU)  over  a  finite  time  interval  T,  and  E[  ]  is 
the  expected  value  of  a  random  process.  Note  that  this 
definition  assumes  that  the  random  process  is  station¬ 
ary  to  the  necessar>  order.  Nonlinearities  result  in 
the  redistribution  of  a  signal’s  energy  among  various 
spectral  components  The  polyspectra  provide  a  means 
of  quantifying  the  resulting  correlation  among  the 
spectral  components.  The  bispectrum  is  particularly 
useful  for  the  analysis  of  quadraticaliy  nonlinear 
systems. 


The  authors  are  with  the  Department  of  Electrical  and  Com¬ 
puter  Engineering.  University  of  Texas  at  Austin.  .Austin,  Texas 
7S712. 

Received  2  January  1 9?  1 
0003-6935 .92  081015-1 5$05,00  0. 
c  1992  Optical  Societv  of  America 


Although  digital  implementations  of  the  polyspec¬ 
tra  have  found  numerous  applications  (several  of 
which  are  described  in  Section  II),  the  calculation  of 
these  quantities  can  be  quite  computationally  inten 
sive.  In  the  computation  of  the  discrete  bispectrum. 
the  formation  of  the  triple  products  of  discrete  Fou¬ 
rier  transforms  (DFT's)  requires  of  the  order  of  A'3 
complex  multiplies  per  second  for  real-time  operation 
(where  N  is  the  size  of  the  DFT  and  P  is  the  signal 
bandwidth}.  This  does  not  even  account  for  the  initial 
computation  of  the  DFT’s.  Thus  the  applications  to 
date  have  been  limited  to  higher-order  spectral  analy¬ 
sis  of  fairly  narrow-band  signals  and/or  non-real-time 
processing.  To  extend  the  techniques  of  higher-order 
spectral  analysis  to  wider  bandwidths  (10  MHz  and 
above)  in  real  time,  one  rapidly  outruns  the  capabili¬ 
ties  of  existing  general-purpose  digital  computers  and 
is  forced  to  look  for  alternative  methods  of  ii.iplemen- 
tation.  Other  constraints  (size,  weight,  power,  and 
cooling),  in  addition  to  processing  speed,  must  often 
be  considered  in  selecting  such  a  method.  In  this 
spirit,  optical  signal-processing  techniques  appear  to 
hold  promise  as  alternative  or  complementary  meth¬ 
ods  of  implementation  of  higher-order  spectra  for 
wide  bandwidth  signals. 

Optical  signal-processing  techniques  have  proved 
to  be  quite  powerful  in  performance  as  well  as  in 
flexibility.  One  of  the  key  advantages  that  these 
methods  offer  is  the  ability  to  perform  real-time 
spatial  Fourier  transforms  with  lenses.  The  ability  to 
convert  wide  bandwidth  time-varying  electrical  sig- 
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nals  into  space-varying  light-amplitude  distributions 
by  using  acousto-optic  (AO)  m^ulators  is  another 
important  attribute  of  optical  processing  systems. 
The  most  basic  signed-processing  function  imple¬ 
mented  with  acousto-optics  is  real-time  power- 
spectrum  analysis.  A  sampling  of  the  other  appli¬ 
cations  include,  optical  correlators,  convolvers, 
interference  rejection  filters,  and  optical  fast  Fourier 
transform  processors.*^  Two-dimensional  optical  pro¬ 
cessors  have  been  developed  to  compute  such  quanti¬ 
ties  as  the  ambiguity  function,’  the  cyclostationary 
function,  and  the  Wigner  distribution.^  A  noninterfer- 
ometric  triple-product  processor*  has  also  been  re¬ 
ported. 

An  optical  realization  of  the  bispectrum  was  devel¬ 
oped  at  the  University  of  Texas  by  Kauderer  et  al.‘ 
following  the  preliminary  studies  by  Florence  and 
Song.*  The  overall  optical  configuration  was  a  varia¬ 
tion  of  a  Mach-Zehnder  interferometer  (which  in¬ 
cluded  several  lenses  for  transforming  and  scaling) 
and  three  AO  modulators  (as  spatial  light  modula¬ 
tors).  The  propagation  of  the  diffracted  light  through 
the  system  of  lenses  and  the  superposition  of  the  two 
interferometer  paths  was  shown  to  generate  an  inten¬ 
sity  distribution  in  the  output  plane  involving  the 
same  product  of  Fourier  transforms  as  the  bispec¬ 
trum.  The  experimental  part  of  Kauderer’s  work 
provided  a  proof-of-principle  demonstration  of  the 
optical  bispectrum  processor.  The  tests  performed 
emphasized  the  measurement  of  the  magnitude  of 
the  bispectrum  (actually  the  bicoherence,  a  normal¬ 
ized  form  of  the  bispectrum)  for  sinusoidal  input 
signals  by  obse’^ng  the  interference  pattern  over  a 
localized  region  of  the  bifrequency  plame. 

Th?  cbjectivr  the  research  presented  in  this 
paper  was  to  develop  an  optical  processor  capable  of 
computing  both  the  magnitude  and  the  phase  of  the 
bispectrum  for  wide  bandwidth  input  waveforms.  For 
such  waveforms,  it  was  necessary  to  compute  the 
value  of  the  bispectrum  over  an  extended  region  of 
the  output  ibifrequency)  plane  rather  than  simply  for 
a  single  point.  Some  modifications  to  the  existing 
optical  architecture  were  necessary  to  satisfy  this 
requirement.  A  substantial  amount  of  effort  was 
directed  at  the  digital  processing  of  the  output  of  the 
optical  system  to  recover  the  magnitude  and  phase  of 
the  bispectrum.  The  interference  term  in  the  output 
intensity  distribution  consisted  of  the  real  part  of  the 
bisjjectrum  imposed  on  a  spatial  carrier.  This  term 
was  isolated  from  the  more  slowly  varying  noninter¬ 
ference  terms  by  digital  filtering.  A  discrete  Hilbert 
transform  approach  was  used  to  compute  the  imagi¬ 
nary  part  of  the  bispectrum  from  the  real  peu^.  The 
magnitude  and  phase  were  then  comput^  in  the 
normal  fashion.  The  processor  was  tested  with  four 
different  rf  signal  sets,  which  produced  four  different 
magnitude-phase  profiles  for  the  bispectrum.  The 
experimental  results  reported  here  are  in  good  agree¬ 
ment  with  expectations. 

In  the  next  section,  some  basic  definitions,  proper 
ties,  and  applications  of  the  bispectrum  are  discussed. 


Section  III  describes  the  optical  implementation  of 
the  bispectrum  including  the  optical  layout,  the  rf 
test  signal  generation  process,  and  the  initial  output 
image  processing  The  application  of  the  discrete 
Hilbert  transform  lo  the  problem  of  recovenng  bispec¬ 
trum  magnitude  and  phase  is  the  topic  of  Section  IV. 
Test  results  that  demonstrate  the  capability  of  the 
processor  to  recover  these  quantities  for  wide  band¬ 
width  input  waveforms  are  presented  in  Section  V, 
The  last  section  provides  a  summary, 

II.  Oaflnitions  and  Prof>*rUas  of  tha  Bispactrum 

Before  discussing  the  recent  developments  in  optical 
bispectrum  processing,  we  summarize  some  key  defi¬ 
nitions  and  properties  of  the  bispectrum.  It  is  well 
knowm  that  the  power  spectral  density  function  for  a 
wide-sense  stationary  random  process  can  be  defined 
as  the  Fourier  transform  of  the  correlation  function. 
The  higher-order  spectra  also  have  associated  higher- 
order  correlations,  which  are  related  to  the  spectra 
through  the  Fourier  transform.  Our  primary  interest 
is  in  the  second-order  correlation,  which  is  also 
known  as  the  bicorrelation,' 

■'>)  *  Elgitlfit  -  t,)f  it  ~  1,').  )2i 

where /■(/)  andg(/)  are  zero-mean,  real-valued  station¬ 
ary  random  processes.  Note  that  what  we  have  writ¬ 
ten  is  actually  the  cross  bicorreiation  The  cross 
bispectrum  can  then  be  expressed  as  the  Fourier 
transform  of 

/"i.  •  /  /  ^^*1.  T,)e*p[-2iry(  f,?,  dr,.  (3) 

or  directly  in  terms  of  the  Fourier  transforms  of  git) 
and  fit), 

Bgri  ~  UmT-'ElCi  f,  +  f,)]  i*> 

The  bispectrum  is  a  complex-valued  function  of  two 
frequency  variable.*:,  and  it  measures  the  correlation 
between  the  spectral  components  at  frequencies  /■, 
and  and  their  sum  frequency  f  -i-  caused,  for 
example,  by  a  quadratic  (square-law)  nonlinearity. 
Two  conditions  must  be  met  for  the  bispectrum  to  be 
nonzero.  Obviously,  the  signal  of  interest  must  have 
nonzero  spectral  components  at  each  of  the  three 
frequencies  /",,  f^,  and  Z",  =  /",  +  f-,-  This  condition  is 
referred  to  as  the  frequency  sum  rule.  The  second 
condition  requires  that  the  spectral  components  be 
statistically  dependent.  In  essence,  this  requirement 
can  be  satisfied  by  demanding  that  their  respective 
phases  obey  a  phase  sum  rule  given  by  =  4),  +  + 

constant.  Another  way  of  stating  this  is  that  the 
spectral  components  must  be  phase  coherent  for  the 
bispectrum  to  be  nonzero. 

It  is  possible  that  a  given  time  series  xiti  may 
contain  a  nonzero  spectral  component  at  f,  -  f  +  fi 
that  is  independently  generated  and  hence  not  corre¬ 
lated  with  the  componei.ta  at/",  and/j.  This  situation 
does  not  contribute  to  the  bisp^rum  at  fj  because 
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the  phaee  sum  rule  will  be  violated.  The  difference 
between  the  results  obtained  for  the  bispectrum  for 
quadraticaliy  related  (phase-coherent)  and  indepen¬ 
dently  generated  (incoherent)  components  provides  a 
powerful  means  of  distinguishing  the  two  situations. 
One  can  also  imagine  a  situation  in  which  there  is  a 
mixture  of  coherent  and  incoherent  energy.  For  such 
a  situation,  the  (cross)  bicoherence  function  can 
provide  useful  insight: 

The  bicoherence  ranges  from  zero  (completely  incoher¬ 
ent)  to  one  (completely  coherent). 

The  bispectrum  has  been  used  extensively  at  the 
University  of  Texas  at  Austin  for  the  treatment  of 
such  problems  as  plasma  fluctuations,"  transition  to 
turbulence  in  subsonic  airflow,*"  the  nonlinear  re¬ 
sponse  of  moored  vessels  or  tension  leg  platforms  to 
random  sea  excitation,'*  and  nonlinear  system  model¬ 
ing  in  the  frequency  domain.*"  Others  have  applied 
these  methods  to  the  enhancement  of  atmosphere- 
degraded  infrared  astronomical  observations,'**  to  the 
study  of  nonlinear  relationships  in  electroencephalo¬ 
grams,**  and  for  emitter  characterization  in  holo¬ 


Path  B 


graphic  sonar.*"  '*  Some  of  these  applications  make 
use  of  the  fact  that  the  bispectrum  is  zero  for 
zero-mean  additive  Gaussian  processes  in  order  to 
enhance  a  non-Gaussian  signd  of  interest  in  the 
presence  of  Gaussian  noise.  None  of  these  applica¬ 
tions  is  particularly  stringent  in  terms  of  its  process¬ 
ing  requirements.  Thus  a  distal  implementation  on 
general-purpose  hardware  has  been  the  method  of 
choi(». 

III.  Optical  Implamanlatton  of  the  Blapectnim 

As  stated  in  the  introductory  section,  the  primary 
objective  of  this  research  is  to  compute  the  magnitude 
and  phase  of  the  bispectrum  over  an  extended  region 
of  the  bifrequency  plane  for  wide  bandwidth  signals 
This  section  describes  the  implementation  of  the 
optical  bispectrum  processor  in  terms  of  the  basic 
optica]  architecture,  the  circuit  for  rf  test  signal 
generation,  test  case  selection,  and  image  acquisition 
and  initial  processing 

A  Optical  Layout 

Figure  1  shows  the  current  optical  architecture  used 
to  implement  the  bispectrum.  The  two  modulators  in 
path  A  have  a  40-MHz  center  frequenej-  and  a  20- 
MHz  bandwidth,  while  the  third  AO  cell  has  a 


Pig  I  Optical  layout  for  the  biSpectrum  processor  Inset:  Orientation  of  the  AO  cells 
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70-MHz  r  iiter  frequency  and  a  40  MHz  bandwidth 
Baae^  id  signals  f  (/)  and  g(t)  are  upahifled  into  the 
passbands  of  the  modulators  The  upshifted  signals 
are  denoted  by  f’\t)  and  g’W.  Each  AO  cell  has  a 
maximum  time  aperture  of  5  ns,  which  results  in  a 
minimum  frequency  resolution  of  200  kHz.  It  should 
be  noted  that  the  processor  depicted  in  the  figure  is 
capable  of  computing  the  cross  bispectrum  between 
the  signals  f  ‘(t)  and 

In  each  of  the  interferometer  paths,  the  light  from 
the  He-Ne  laser  is  expanded  and  recollimated  The 
spherical  and  cylindrical  lenses  are  positioned  with 
respect  to  the  modulators  and  oriented  so  that  the 
light  comes  to  a  focus  as  a  line  inside  the  cells.  The 
inset  of  Fig.  1  shows  the  orientation  of  each  of  the 
three  modulators  as  viewed  from  the  output  end  of 
the  interferometer.  In  Path  A,  the  first  modulator  is 
oriented  horizontally.  The  first  diffracted  order  of  the 
horizontal  cell  illuminates  the  second  modulator, 
which  is  oriented  vertically. 

Although  the  basic  optical  architecture  (a  modified 
Mach-Zehnder  interferometer)  remains  the  same  as 
that  used  by  Kauderer  et  al.,^  the  present  effort  to 
generalize  the  technique  to  allow  optical  bispectrum 
processing  of  wide  bandwidth  waveforms  requires  a 
modification  in  the  Path  B  portion  of  the  layout. 
Specifically,  the  third  AO  modulator  is  oriented  at  a 
45-deg  angle,  and  its  output  is  scaled  by  ^2  with 
respect  to  Path  A  to  provide  the  correct  overlap 
between  transforms  over  an  extended  region  of  the 
bifrequency  plane.  This  is  where  the  present  layout 
differs  from  that  used  by  Kauderer.  The  new  configu¬ 
ration  for  the  third  modulator  and  the  imag^  scaling 
ensure  that  the  frequency  sum  rule  is  satisfied  and 
that  efficient  use  is  made  of  the  available  light. 

In  our  discussion  of  the  bispectrum,  it  is  more 
natural  to  talk  in  terms  of  temporal  frequency  (hertz 
or  megahertz)  rather  than  spatial  coordinates.  Thus, 
we  will  adopt  the  symbols  and  as  representing  the 
two  temporal  frequency  variables  (in  megahertz)  that 
have  been  mapp^  onto  a  spatial  coordinate  system 
iu,v)  according  to  /",  =  uvjkf,  and  -  vvjkf„  where 
u,  is  the  acoustic  velocity  in  the  AO  cells,  X  is  the 
optical  wavelength  in  free  space,  and  /j  is  the  focal 
length  of  the  transforming  lens.  Considering  only  the 
first  diffracted  orders  from  the  modulators  and  assum¬ 
ing  a  plane-wave,  Gaussian  amplitude  profile  illumi¬ 
nation,  we  can  write  the  output  amplitude  distribu¬ 
tion  from  Path  A  as^ '' 

F'i  fi.tiF  '  f,.  t'  =  j  expt  -2-:^ j  f,x  r.' 

X  expi -x^  20^1 /" 'll  -  tV'di  /  e*p( -2ir_/ 

X  exp(->’,'2v')/’'(y  -  l6) 

and  from  Path  Bas 

=  /  exp(2T:_/i  /■,+/'j)x  L,iexpi-x’/2(T’l/f  '(x  -  vj)dx.  (7) 

where  we  have  suppressed  the  optical  plane-wave 


factors  (i.e.,  exp(  -  k  D]  and  constant  factors 
outside  the  integrals  Note  that  in  Eq  (7)  the  varia¬ 
tion  of  the  Gaussian  beam  in  the  direction  perpendic¬ 
ular  to  the  transform  direction  is  assumed  to  be  slow 
enough  over  the  region  of  interest  i  =  fj  '  that  this 
factor  can  be  approximated  by  unity.  If  we  include  the 
0,  ‘  factor  in  the  argument  of  the  transforms,  we  can 
express  the  transforms  in  spatial  frequency  units  We 
have  suppressed  this  factor  for  now  in  order  to  make 
the  connection  with  the  temporal  frequencies  clearer. 
Prior  to  recombining  the  beams  from  Paths  A  and 
B,  we  need  to  reintroduce  the  plane-wave  factors  that 
go  with  them.  The  intensity  measured  in  the  output 
plane  is  given  by  the  square  of  the  sum  of  the 
amplitudes.  The  charge-counled  device  (CCD)  camera 
placed  in  the  output  plane  integrates  this  intensity 
distribution  over  a  fairly  long  time  interval,  resulting 
in  a  measured  output  intensity  given  by 

/«.  «  +  :C  (A 

2  Re'G  (  /■,  +  ftiF  ’i  f,iexplji>i  (,.  f,i]  ,  (8) 

where  the  previous  time  dependence  has  been  can¬ 
celed  directly  or  averaged  out.  The  complex  exponen¬ 
tial  in  the  last  term  represents  the  spatial  dependence 
of  the  interference  fringes.  The  phase  factor  is  a 
function  of  the  curvature  of  the  interfering  optical 
wave  fronts  and  the  angle  between  the  propagation 
vectors.  For  present  purposes,  this  is  assumed  to  be  a 
linear  function  of  the  spatial  coordinates,  giving  rise 
to  straight  fringes  of  uniform  spacing  (i.e.,  a  constant- 
frequency  spatial  carrier).  Note  that  the  last  term  in 
the  expression  has  the  same  form  as  the  cross  bispec¬ 
trum.  while  the  first  two  terms  represent  the  inten¬ 
sity  of  the  light  from  each  path  in  the  absence  of  the 
other.  Figure  2  indicates  the  resulting  intensity  distri¬ 
bution  in  the  interference  plane  for  typical  input 
signals  and  the  projections  of  their  transforms  on  the 
appropriate  axes.  The  presence  of  a  nonzero  bispec- 


Fip  2  Output  inten-Sity  distribution  and  projection#  of  the  input 
spectra 
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tnim  will  result  in  the  appearance  of  interference 
frin^s  in  the  output  image.  The  fringe  contrast  is 
^verned  by  the  relative  amount  of  power  present  in 
the  coherent  <  quadraticaliy  related*  and  incoherent 
(independently  generated)  signal  components  that 
were  used  to  drive  the  AO  cells. 

B.  Rf  Input  Circuitry 

The  task  of  generating  the  rf  input  signals  to  test  the 
AO  bispectrum  processor  effectively  requires  a  high 
degree  of  flexibility  and  control  over  the  waveform 
parameters  as  well  as  a  relatively  wide  bandwidth. 
The  bispectrum  measures  the  correlation  between 
difieient  spectral  components  satisfying  the  fre¬ 
quency  sum  rule  In  testing,  it  is  desirable  to  be  able 
to  specify  this  correlation  as  we  see  fit,  that  is,  to 
introduce  signals  related  by  an  arbitrary  quadratic 
transfer  function  Furthermore,  the  signals  of  inter¬ 
est  should  be  of  the  order  of  10  MHz  in  bandwidth 
(for  our  particular  system).  It  is  also  helpful  if  two 
different  signals  can  be  produced  simultaneously 

The  above  requirements  were  met  by  the  Hewlett- 
Packard  HP8770A  arbitrary  waveform  synthesizer 
(AWS).  The  HP8770A  is  the  source  for  aU  the  signals 
used  to  drive  the  AO  modulators  Fig.  3).  An 
arbitrary  digital  sequence  representing  samples  of 
the  desired  waveform  is  downloaded  from  a  microcom¬ 
puter  to  the  AWS  by  a  GPIB  interface  (a  general- 
purpose  interface  bus.  conforming  to  the  IEEE-488 
standard)  .  In  turn,  the  AWS  converts  the  sequence 
(12-bit,  125-  MHz  sample  rate)  to  an  analog  signal 
with  a  bandwidth  of  up  to  50  MHz.  The  maximum 
output  is  2  V  peak  to  peak,  and  the  output  impedance 
is  50  Cl.  In  edition  to  the  standard  signal  output,  the 
AWS  produces  a  10-MHz  sinusoidal  reference  signal 
that  is  phase  locked  with  its  125-MHz  clock. 

In  our  experiments,  the  Path  A  and  Path  B  drive 
signals  were  generated  simultaneously  by  a  single 
AWS  by  means  of  frequency  multiplexing.  The  Path  A 
drive  signal  f'[t)  was  generated  directly  [i.e.,  the 
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Fig  3  Rf  signal  generation  circuit. 


baseband  signal  fit)  was  upshifled  into  the  30-50- 
MHz  passband  of  the  AO  cells  in  the  signal  definition 
process).  This  was  then  added  to  the  b^band  signal 
git),  producing  the  AWS  output  signal 

xdi  -  /■'</>  -»■  gu>  •9- 

This  signal  was  split  and  sent  to  both  rf  dnve  circuits 
The  Path  A  drive  voltages  were  produced  by  bandpass 
filtering  (30-50-MHz  passband)  xit)  and  amplifvTng 
the  resulting  signal.  Producing  the  Path  B  drive 
signal  was  somewhat  more  involved  since  g(/)  must 
be  placed  in  the  passband  of  the  third  AO  cell  The 
AWS  output  signal  was  mixed  with  an  80  MHz  local 
oscillator,  then  bandpass  filtered  to  isolate  the  signal 
of  interest  before  rf  amplification  It  is  important  to 
rote  that  the  80-MHz  carrier  used  must  be  phase 
coherent  with  the  AWS  output  signal  in  order  to 
maintain  coherence  between  the  signals  and 
g'(t).  To  achieve  this,  the  10-MHz  sine-wave  refer¬ 
ence  signal,  which  is  provided  by  the  AWS,  wa.- 
doubled  three  times,  resulting  in  an  SO  MHz  signal 
with  the  proper  phase  relationship. 

Figure  4  shows  the  spectra  of  the  Path  A  and  Path 
B  drive  voltages  and  the  origintil  AWS  output  signal 
The  shaded  regions  represent  the  spectra  of  t.ie 
signals,  and  the  modulator  passbands  are  indicatini 
by  dotted  lines  The  unshaded  regions  represent 
missing  upper  sidebands  In  the  case  of  F  (  /  !,  the 
sideband  was  suppressed  digitally  in  the  definition  of 
the  signal.  For  G‘{  f),  the 60-80-MHz  bandpass  fitter 
supprei  sed  the  upper  sideband,  which  was  produced 
as  a  result  of  the  rf  mixing  process.  We  think  of 
signals  F'if)  and  G'if)  as  representing  the  lower 
sidebands  of  a  pair  of  baseband  signals  placed  on  40- 
and  80-MHz  carriers,  respectively.  The  spectral  con¬ 
figuration  of  F '  and  G '  was  selected  to  achieve  the 
phase-coherence  requirement  as  well  as  to  optimize 
the  position  of  the  spectra  in  the  modulator  pass- 
bands,  to  help  Mdth  dynamic  range  considerations, 
and  to  satisfy  the  frequency  sum  rule. 

C.  Test  Signal  Definition 

At  this  stage  of  our  -esearch,  it  is  necessary  to  balance 
the  desire  to  process  more  complicated  test  signals 
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Fig  4  .Spectra  of  the  AWS  output  signa)  (ai  and  the  AO  cell  input 
aignalstb)  and  Ic). 
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with  the  practical  need  to  have  signals  that  will 
produce  an  understandable  optical  output  when  their 
bispectrum  is  computed.  In  order  to  satisfy  these 
constraints  and  test  the  wide  bandwidth  processing 
capability  of  the  optical  implementation,  the  single- 
tone  tests  used  by  Kauderer  et  a/.’  have  been  replaced 
With  tests  based  on  sets  of  uniform  amplitude,  closely 
spaced  tones,  whose  frequencies  and  phases  are  to  be 
specified  This  approach  is  attractive  from  several 
perspectives.  In  particular,  the  mathematical  analysis 
of  the  problem  and  the  digital  sequence  generation 
remain  relatively  simple. 

Each  of  the  test  cases  presented  here  was  based  on 
a  single  choice  otfU)  representing  the  input  to  some 
quadratically  nonlinear  system.  The  output  of  the 
system,  was  related  to/" (t)  according  to 

gm  -  /i,u  -  f,.  1  -  1,1 ilO) 

where  h,  was  the  desired  quadratic  impulse  response. 
In  the  frequency  domain,  this  became 

^ /'  ' 

where  /"j,  fj)  was  the  quadratic  transfer  func¬ 
tion. 

The  signal  fit)  consisted  of  a  sum  of  30  unifonm 


amplitude,  oonstant-phaae  sinuaoida; 

t  **  : 

/■if)  •  R»l/(|t]  -  (12^ 

where it  200  klij  Tliis  gave  fit)  an  overall 
bandwidth  of  6  MHz.  Note  that  this  choice  of  fits 
could  also  be  d^cribed  at  t  term  of  tmc  pultee  5  iis 
apart  (see  Fig.  5(a)).  Thut  the  problem  could  be 
restated  as  one  of  invesUgating  the  response  of  a 
quadratically  nonlinear  i^etem  to  an  impulsive  input 
probe  signal — an  approach  commonly  used  in  many 
physical  situations. 

In  the  frequency  domain,  the  complex  signal  f  (ti 
became 

m 

hfi-'Zhif- f.)  Sl3i 

Since  the  tones  had  constant  ampbtudes  and  phases 
and  were  spaced  at  the  minimum  resolvable  spacing 
for  the  AO  cells,  the  optical  output  ,F'if.sF'i  f^)  ' 
was  a  uniform  square  region  (6  MHz  x  6  MHz)  in  the 
bifrequency  plane. 

With  the  signal  f  it  )  specified,  the  remainder  of  the 
signal  synthesis  process  involved  the  selection  of  the 
quadratic  transfer  function  //,( f,  /",).  used  to  pro¬ 
duce  git)  from  fit).  It  is  useful  to  begin  m  the 
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Fig  5 
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IS)  Definition  of  fir  i.  (bi  relationship  between  ^  if )  and  gif  s 
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frequency  domain  with  Eq.  (11)  and  substitute  >n  F 
from  Eq.  (13): 

h  .% 

/■ )  -  2  f-  /■-/■.  -  A)  114) 

In  the  time  domain  this  becomes 

I  »i  V  I 

S>«>  «  Re(i(/))  »  ^  f^.  fJexp[2Ttji  f,  +  A*<11  ns.i 

i-.i  ..1  I 

Note  that  is  evaluated  only  at  certain 

discrete  frequencies. 

The  actual  quadratic  transfer  functions  (QTF’s) 
used  can  be  modeled  with  a  square-law  operation 
followed  by  a  linear  system.  This  is  depicted  in  Fig. 
5(b).  The  specific  QTF’s  chosen  do  not  necessarily 
represent  r^  physical  systems.  Mathematically,  they 
can  be  written  as  a  product  of  two  factors; 

Wt  L.  A I  -  L  +  /■,»«<,(  L.  fj.  <161 


The  factor  will  be  taken  to  be  unity  for  all 
frequency  pairs  /",  and  /j.  Considered  by  itself,  this 
corresponds  to  a  simple  square-law  relation;  ^(t)  * 
f(t  )^  This  results  in  a  triangular  spectrum  represent¬ 
ing  convolution  in  the  frequency  domain,  G(f)  = 
F{  f)  •  Fi  f),as  Fig.  5(b>  in^cates. 

For  the  term  four  different  linear  transfer 
functions  (LTF’s)  are  used.  Figures  6(a)-6(d)  depict 
the  magnitude  and  phase  profiles  for  Fi  f )  and  Gif) 
associated  with  each  LTF.  The  first  LTF  is  unity  and 
produces  the  tnangular  spectrum  (a)  just  described. 
The  second  LTF  normalizes  the  magnitude  of  the 
output  of  the  square-law  device,  returning  it  to  a 


fa) 


Off) 

In  1 1 — 1 


Fig  6  SpecUa]  profiles  for  test  signali;  related  by  various  QTF's;  lai 
square  law.  lb)  normalized  magnitude,  (c)  normalized  magnitude 
and  quadratic  phase,  and  fd)  normalized  magnitude  and  step 
phase. 


rectangular  shape  (b)  It  has  the  form 
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The  normalization  step  is  intended  to  facilitate  the  rf 
diagnostics,  simplify  the  optical  output,  and  lessen 
the  image  acquisition  dynamic  range  requirements 
The  third  and  fourth  LTF’s  also  have  magnitude 
responses  given  by  They  differ  from  the  second 
LTF  in  their  phase  response.  The  first  and  second 
LTF’s  have  constant-phase  profiles  while  the  third 
and  fourth  have  quaorstic  (c)  and  step-phase  profiles 
tdi,  respectively. 

0  Image  Acquisdion  and  Initial  Processing 

The  images  formed  in  the  output  plane  were  acquired 
byusinga512  x  512  pixel  CCD  camera  and  microcom 
puter-based  video  frame  grabber  The  relatively  slow 
frame  rate  (30  frames/s)  was  consistent  with  our 
assumption  of  a  long  integration  time.  The  images 
were  digitized  to  8-bit  resolution  In  the  tests  pre¬ 
sented  here,  the  6  MHz  x  6  MHz  region  of  interest  in 
the  bifrequency  plane  sfianned  roughly  a  128  »  ]28 
pixel  subframe.  Although  the  bispectrum  is  a  func- 
tion  of  position  and  the  fringes  have  a  spatial  depen¬ 
dence,  the  images  acquired  in  this  series  varied  slowly 
enough  that  the  spatial  sampling  rate  was  not  a 
problem.  We  make  use  of  this  property  in  the  discus¬ 
sion  below 

Before  continuing  the  discussion  of  the  recovery  of 
bispectral  magnitude  and  phase,  it  is  appropriate  tt- 
make  the  connection  between  the  information  present 
in  the  output  plane  and  the  equivalent  digiul  imple¬ 
mentation  of  the  bispectrum.  The  AO  cells  used  in 
our  experiments  had  S-us  time  apertures  Thus  the 
effective  time-bandwidth  product  for  the  6-MHr 
subband  was  30.  Equating  this  to  the  number  of  DFT 
bins  for  a  digital  implementation  means  that  the 
effective  processing  rate  for  the  formation  of  triple 
products  of  DFT’s  was  of  the  order  of  (6  MHz  x  30  = 
180  million  complex  multipbes/s).  This  was  signifi¬ 
cant  but  not  overwhelming.  The  potential  of  the 
optical  realization  of  the  bispectrum  is  more  dramati¬ 
cally  illustrated  by  considering  a  processor  employing 
the  most  powerful  commercially  available  AO  modula¬ 
tors  with  bandwidths  of  the  order  of  1  GHz  and 
time-bandwidth  products  of  the  order  of  1000  Effec¬ 
tive  processing  rates  in  the  trillion  complex  multi- 
plies/s  range  seem  possible. 

The  magnification  of  the  output  image  was  such 
that  128  pixels  roughly  corresponded  to  a  6- MHz 
span.  For  subsequent  processing,  a  more  accurate 
spatial-to-frequency  calibration  must  be  obtained  It 
was  also  necessary  to  compensate  for  inversion  of  the 
image  and  the  nonunity  aspect  ratio  of  the  CCD  array 
(it  is  wider  than  it  is  high).  A  calibration  test  involv¬ 
ing  a  set  of  four  tones  separated  by  2  MHz  was 
included  as  a  part  of  the  standard  data  acqui.sition 
process.  The  signal  was  used  to  drive  the  two  Path  A 
modulators.  The  output  image  took  the  form  of  a  4  x 
4  grid  of  intensity  peaks  that  spanned  the  6  MHz  x  6 
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MHs  region  of  interest.  The  intensity  mssims  were 
located  and  associated  with  the  appropriate  pair  of 
temporal  frequencies.  Two  points  on  the  extreme 
comers  were  used  to  determine  the  slope  and  int^- 
cept  for  the  pixel-to-frequency  conversion  for  each 
dimension.  Such  a  measurement  was  made  each  time 
anything  affecting  the  calibration  changed,  including 
image  magnihcation  and  fringe  orimtation. 

A  256  X  256  pixel  subframe  (roughly  12  x  12  IdHx) 
was  extracted  from  the  original  image  frame  and  was 
scaled  by  using  the  calibration  information  described 
m  the  previous  paragraph.  In  the  process,  the  original 
subframe  was  mapped  into  a  new  one  properly  ori> 
ented  and  scaled.  Four  adjacent  pixels  of  the  origina] 
subframe  were  averaged  together  to  form  a  sin^ 
output  pixel.  Thus  the  new  subframe  had  a  dimension 
of  128  X  128  pixels  corresponding  to  12  x  12  MHz  in 
the  bifrequency  plane.  This  last  step  can  be  viewed  as 
a  low-pass  filter  and  decimate  operation  that  serves  to 
smooth  the  output  as  well  as  to  reduce  the  image 
processing  requirements. 

E  Interference  Term  Isolation 

The  interference  image  subframes  were  subsequently 
processed  to  isolate  the  interference  term  (which 
contains  the  bispectrum  information)  and  to  recover 
magnitude  and  phase  information.  The  int«uify 
distribution  observed  was  as  predicted  in  Eq.  (8) 
except  that  the  effect  of  additive  background  zmiae 
must  be  included.  The  expression  then  bi^mes 

ifTl’  iO'l’  +  2R«(F'*f(V‘)  +  (18) 

where  n,(  /*„  /*,)  is  the  background  term.  In  order  to 
isolate  the  bispectrum  information  contained  in  the 
interference  fringes,  the  noninterference  and  back¬ 
ground  noise  contributions  to  the  total  intmiaity 
must  be  subtracted  or  otherwise  filtered.  After  divid¬ 
ing  the  result  by  two,  we  are  left  with  the  desired  term 
representing  the  real  part  of  the  cross  bispectrum  on 
a  spatial  carrier: 

4,< -  B«rfF  *(/-,)F-(/i)G’(/-.  +/;)e-J.  (18) 

The  approach  to  isolating  the  interference  term  is 
depicted  in  Fig.  7(8).  It  differs  from  the  method  used 
by  Kauderer  et  al.,  which  relied  on  the  sequential 
acquisition  of  interference,  noninterference,  a^  back¬ 
ground  frames.  The  interference  term  containing  the 
bispectrum  information  is  isolated  by  subtracting  the 
other  frames.  There  are  two  princi|^  disadvantages 
to  this  method;  (1)  the  ne^  to  acquire  multiple 
frames  and  (2)  the  frame-to-frame  independence  of 
the  noise  that  effectively  increases  the  noise  variance 
of  the  final  frame.  The  new  method  uses  the  spectral 
characteristics  of  the  images  in  the  present  test  set  to 
accomplish  the  isolation  step  based  solely  on  the 
interference  frame.  Even  after  the  decimation  step 
that  atxompanies  the  scaling  of  the  images,  the 
interference  images  are  still  oversampled  ^  more 
than  two  in  each  dimension.  The  first  step  in  the  new 
isolation  technique  is  to  average  another  four  pixeb 
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Fig.  7.  (a)  Flow  chart  of  the  intarforanot  tam  laolation  ptoota*. 
(b)  croas  aaelion  of  aiinulatad  ixUaHarcoot  uoafc  bafora  the  daciiDa- 
tioo  atap,  (c)  aftar  tfaniaation  atap,  and  (d>  aftar  K)gfa-pa« 
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(two  by  two)  into  a  single  pixel  and  thus  to  (uoduce  a 
new  64  X  64  |^1  inmip.  This  represents  a  second 
k>w-paaa  filter  and  dedmatitm  si^.  The  effect  of 
decimation  on  the  interference  image  is  illustrated  in 
Figs.  7(b)  and  7(c),  whid)  ehow  simulated  one- 
dimenaional  slices  of  the  trsnaform  of  Uw  image 
before  and  after  the  decimation  atep.  Initially,  the 
spectrum  of  the  image  is  compreaaed  near  the  origin. 
The  decimation  operatkm  ex|wnds  the  transform  in 
ttmoi  of  the  nonnsdizsNl  frequency. 

There  is  sntAher  fsature  that  is  illustrated  in  Figs. 
7b  and  7c.  The  interference  and  noninterference 
terma  are  shown  as  sepmte  in  the  frequency  domain. 
Thia  ia  becauae  the  mterferenoe  frhiges  act  as  a 
spatial  carrier  for  the  biapectrum  information.  De¬ 
pending  on  the  spatial  fluency  of  the  fringes  and 
the  bai^width  of  the  bisj^ruro,  the  interference 
term  may  or  may  not  be  separate  from  the  noninter¬ 
ference  terms.  The  condition  for  sufficient  separation 
is  that  the  spatial  carrier  be  greater  than  the  sum  of 
half  of  the  bandwidth  of  the  noninterference  terms 
plus  half  of  the  bandwidth  of  die  interference  term. 
Fortunate^,  our  data  aet  includes  images  Uiat  satisfy 
this  condition.  This  brings  us  to  the  second  st^  of  the 
image  isolation  prooeaa.  The  interference  frame  is 
high-pass  filtered  to  remove  the  relatively  slowly 
varying  noninterference  terms  and  leave  the  bispec¬ 
trum  information  (on  a  spaUal  carrier).  The  resulting 
intensity  distribution  is  given  by 

*  f,)r^]  f  »i,.  (201 
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where  n,  is  the  modified  background  noise  from  the 
interference  frame.  The  high-pass  filter  is  imple¬ 
mented  using  a  19-tap,  finite  impulse-response  filter 
that  is  convolved  with  each  row  and  each  o)lumn  of 
the  interference  image.  The  filter  was  designed  by  a 
simple  frequency’  sampling  method.**  Its  magnitude 
response  is  shown  in  Fig.  7(d),  superimposed  on  the 
cross  section  of  the  simulated  image  sp^rum.  It  by 
no  means  represents  the  optimal  choice.  However,  it 
is  adequate  for  present  purposes,  as  the  results  that 
follow  show. 


iV.  Magnitude  and  Phase  Recovery 

In  Section  I  it  was  stated  that  the  bispectrum  is 
complex  valued  and  that  a  chief  objective  of  this 
research  was  to  recover  magnitude  and  phase  informa¬ 
tion  from  the  optical  realization  of  the  bispectrum  for 
wride  bandwidth  waveforms.  The  interference  term 
that  has  been  isolated  represents  only  the  real  part  of 
the  bispectrum  We  have  devised  a  method  to  com¬ 
pute  the  imaginary’  part  of  the  bispectrum  from  the 
real  part  by  performing  a  two-dimensional  discrete 
Hilbert  transform  This  section  deals  with  the  theory 
and  implementation  of  the  method. 

A.  Hilbert  Transiorm 

The  Hilbert  transform  relates  the  real  and  imaginary 
parts  of  a  complex -valued  quantity.  In  analog  signal 
theory,  the  Hilbert  transform  is  encountered  in  con¬ 
junction  with  the  theory  of  analytic  signals.**  It  is 
interesting  and  perhaps  not  too  surprising  that  the 
concept  of  the  analytic  signal  was  attributed  to  Ga¬ 
bor,**  who  also  was  responsible  for  early  work  in 
holography.** 

In  electromagnetic  theory,  the  Hilbert  trairsform 
concept  is  seen  in  terms  of  the  Kramers-Kronig 
relations  between  the  real  and  imaginary  parts  of  the 
dielectric  constant.**  These  follow  as  a  result  of  the 
causal  relation  between  the  electric  displacement  and 
the  electric  field.  In  linear  system  theory,  the  Hilbert 
transform  relates  the  real  and  imaginary  parts  of  the 
linear  transfer  function  under  the  constraint  that  the 
impulse  response  is  causal.'* 

A  discussion  of  the  analytic  signal  will  aid  in  our 
understanding  of  the  Hilbert  transform.  We  can 
derive  a  complex-valued  signal  from  a  real-valued 
bandpass  signal  x\t)  as  follows: 


X  by  -j  8gn(  f ).  That  is, 

k(f)»  ~j»gniftXifi  mi 

The  spectrum  of  the  analytic  signal  is  given  by 

Xin‘2X(n  for  f<0. 

X( /■)  "  0  for  f  <  0  (24) 

Thus  in  order  to  produce  a  complex-valued  signal 
from  a  real-valued  signal,  one  merely  sets  the  nega¬ 
tive  part  of  the  spectrum  to  zero,  multiplies  the 
positive  side  by  two,  and  performs  the  inverse  trans¬ 
form- 

f  2ulf)Xl.f)txpl2-ajtf)df  (2S) 

where  uif)  is  a  unit  step  function  in  the  frequency 
domain. 


B.  The  Bispectrum  and  the  Hilbert  T ransform 

This  suggests  an  approach  to  the  problem  of  bispec¬ 
trum  magnitude  and  phase  recovery.  The  question 
becomes:  can  the  real  part  of  the  bispectrum  be 
Hilbert  transformed  to  produce  the  required  imagi¬ 
nary  part?  To  understand  this  more  clearly,  we  must 
invoke  the  Fourier  transform  relation  brtween  the 
bispectrum  and  the  bicorreiation.  Writing  the  real 
and  imaginary  parts  explicitly  we  have 

“  // fif,  +  /ir,))  ♦  R,  im(2iK  Ar,  +  Ar,))|dr,dr„  (26) 

•  //  |R/  C0«(2ir(  Ai,  +  A*?))  -  fi,  •in(2it<  Afi  ♦  A(i>lid<,ilri,  (27) 

where /I(f„  f,)  *  f,)  is  the  bicorreia¬ 

tion  that  may  be  compla  v^ued.  The  Hilbert  trans¬ 
form  of  Re[B(  /■„  /■,)]  is  given  by 

(28) 


Note  that  this  is  a  Hilbert  transform  in  the  frequency 
variable  f,.  The  last  expression  can  be  rewritten  as 


(A  4- A*) 
2 


+ 


R,(A  -  A*)! 

2.  i 


(29) 


iio  ■  xlt)  +  JxU), 


(21)  where 


where  the  imaginary  part  is  the  Hilbert  transform  of 
the  real  part; 


.  1  f«p[2Tr^(r<,  +  A<l»i  ... 

__  if 


(30) 


Note  that  the  Hilbert  transform  takes  a  function  of  a 
given  variable  into  another  function  of  that  same 
variable.  In  the  frequency'  domain,  the  Hilbert  trans¬ 
form  corresponds  to  multiplication  of  the  spectrum  of 


This  integral  can  be  evaluated  by  using  contour 
integration.  The  contour  is  tstken  along  the  real  axis 
and  around  an  infinite  semicircle  in  the  upper  half 
complex/'  plane.  For  the  integral  to  exist,  t,  must  be 
greater  than  zero.  This  is  the  causality  requirement 
associated  with  the  Hilbert  transform.  When  we 
include  the  contribution  at  the  pole  on  the  real 
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axis,  we  find 


M  j  «pl2'trj( <31) 

where  is  a  unit  step  function  in  the  time  domain 
Substituting  this  back  into  the  expression  for  the 
Hilbert  transform  of  the  real  part  of  the  bispectrum, 
we  obtain 

//  -A,  8ml2'ir( /".i,  + +  RiCXialZtti  f,l, 

(32) 

This  equals  the  imaginary  part  of  the  bispectrum 
under  the  constraint  that  Ru^,  tj)  =  0  for  tj  <  0.  A 
similar  Hilbert  transform  relation  exists  between  the 
real  and  imaginary  parts  of  the  bispectrum  if  the 
bicorrelation  is  zero  for  <  0.  The  implications  of 
these  constraints  on  the  bicorrelation  can  be  seen  by 
writing  it  explicitly  in  terms  of  the  signals  /  U)  and 
gU): 

U'>  •  Elgitifit  ~  /,)/■(/  -  (,)j.  (331 

Treating /(i)  as  the  input  to  some  system  andg(r)  as 
the  output,  the  constraints  imply  that  the  present 
outputs  cannot  be  correlated  with  future  inputs.  This 
is  a  sensible  requirement  that  agrees  with  our  con¬ 
cept  of  causality.  Since  physically  realizable  signals 
must  be  causal,  this  does  not  appear  to  be  a  signifi¬ 
cant  restriction. 

C.  Approach  to  Magnitude  and  Phase  Computatton 

Several  authors  describe  various  Hilbert  tranaform- 
related  approaches  to  phase  recovery  from  general 
interference  patterns.  Each  of  these  methods  (and 
ours  as  well)  requires  that  the  spectra  of  the  noninter¬ 
ference  and  interference  fringe  terms  be  separable  in 
the  Fourier  transform  of  the  interference  image. 
Further,  in  each  approach,  a  new  image  is  produced 
in  which  one  of  the  two  sides  of  the  spectrum  of  the 
fringes  is  isolated  from  the  other  spectral  compo¬ 
nents. 

Takeda  et  al.^  introduce  a  fast  Fourier  transform- 
based  method  of  general  fringe  pattern  analysis  in 
which  the  one-sided  fringe  sp^rum  is  simulta¬ 
neously  isolated  and  translated  to  the  origin  in  the 
frequency  domain  of  the  image  and  then  inverse 
transformed.  They  illustrate  the  method  for  a  one¬ 
dimensional  fringe  pattern.  Macy“  compares  the 
Fourier  transform  approach  to  an  approximate  ver¬ 
sion,  called  the  sinusoidal  fitting  method,  and  evalu¬ 
ates  the  results  for  two-dimensional  phase  variations. 
In  a  process  called  quadrature  multiplicative  moire, 
Womack“  multiplies  the  interference  pattern  by  a 
cosine  reference  image  and  low-pass  filters  the  result 
to  isolate  the  difference  frequency  term.  He  repeats 
the  process  with  a  sine  reference  image.  K«is** 
computes  the  discrete  Fourier  transform  of  an  inter¬ 
ference  image,  zeros  out  half  of  the  spectrum,  and 
inverse  transforms  this  to  produce  a  complex-valued 
image  still  on  a  spatial  carrier. 

A  flow  chart  of  our  method  to  compute  the  magni¬ 


tude  and  phase  of  the  bispectrum  by  the  Hilbert 
transform  is  shown  in  Fig.  8.  The  methcid  involve*  the 
convolution  of  the  interference  image  with  the  finite 
impulse  response  sequence  for  the  Hilbert  transform 
The  Hilbert  transformer  is  a  15-tap  finite  impulse 
response  filter  with  alternating  zero  coefficients  ‘‘ 
The  magnitude  and  phase  responses  for  the  trans¬ 
former  aue  shown  in  Figs.  9(a)  and  SKb),  respectively 
Note  that  the  magnitude  rwponse  has  a  bandpass 
character  and  the  phase  reaponae  exhibit*  the  re¬ 
quired  -j  8gn(  /■)  profile. 

We  designate  the  isolated  interference  term  that 
represenu  the  real  part  of  the  bispectrum  as  B,  </■),/,  i. 
where  it  is  to  be  understood  that  the  frequency 
variables  take  on  discrete  values  corresponding  to 
discrete  image  pixel  positions.  We  designate  the  imag¬ 
inary  part  of  the  bispectrum  as  B„  and  we  write  the 
Hilbert  transform  as  a  convolution  in  which  each 
column  of  the  image  (real  part  of  the  bispectrum)  is 
convolved  with  the  transform  coefficients: 

(» 

fit  /..A)  "  2  A  -  n  f  6lB.<n,  (,i  (34' 

a*  ) 

Note  that  we  have  also  included  the  effect  of  the  seven 
sample  filter  delay.  Strictly  speaking,  the  new  com¬ 
plex  image  still  differs  from  the  bispectrum  because  of 
the  phase  factor  that  is  responsible  for  the  fringes 
The  spatial  frequency  of  the  fringes  is  determined  by 
Fourier  transforming  the  isolated  interference  frame 
for  a  reference  test,  computing  the  magnitude  of  the 


Re[  G  F*  F* ...]  Im[  G  F*  P  ...] 
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Fig.  9.  Magnitude  and  phase  response  for  the  Hilbert  trans¬ 
former. 

new  function  (which  is  related  to  the  bicoirelation 
function),  and  locating  the  peaks  associated  with  the 
interference  fringes.  This  spatial  carrier  is  mixed  out 
of  the  complex  bispectrum  image  producing  a  base¬ 
band  image.  The  magnitude  and  (wrapped)  phase  of 
the  bispectrum  are  computed  from  the  standard 
formulas 

B  =  (S?  +  By  \  (35) 

fl. 

*  arcton  ^  ■  (36) 

o. 

The  two-dimensional  phase  is  unwrapped  by  first 
unwrapping  along  a  single  row  passing  through  the 
middle  of  the  image.  Each  column  is  then  unwrapped 
by  working  upward  and  downward  from  the  central 
row.  The  removal  of  the  spatial  carrier  suppresses  the 
fringe  information.  Otherwise  the  unwrapped  phase 
would  exhibit  a  slope  associated  with  the  fringes  that 
are  superimposed  upon  the  true  bispectrum  phase. 

In  this  section  we  have  shown  that  a  Hilbert 
transform  relation  exists  between  the  real  and  imagi¬ 
nary  parts  of  the  bispectrum,  given  reasonable  causal¬ 
ity  constraints.  We  have  further  described  an  ap¬ 
proach  for  recovering  the  magnitude  and  phase  of  the 
bispectrum  for  wide  bandwidth  waveforms.  This  pro¬ 
cess  includes  the  use  of  a  discrete  Hilbert  transform 
on  images  produced  by  the  optical  realization  of  the 


bispectrum.  The  method  is  quite  general  in  that  the 
only  requirements  of  the  ima^  are  that  it  be  suffi¬ 
ciently  sampled  and  that  the  bispectrum  information 
can  be  separated  from  other  components  of  the 
mterference  image.  The  performance  of  the  method 
for  several  test  cases  is  the  subject  of  the  next  section. 

V.  Test  Raauns 

The  Hilbert  transform  approach  to  bispectrum  mag¬ 
nitude  and  phase  recovery  was  tested  with  four  basic 
test  cases.  The  first  two  tests  emphasized  the  magni¬ 
tude  estimation  capability.  The  next  two  focused  on 
the  recovery  of  bispectrum  phase.  Four  different 
quadratic  transfer  functions  were  used  to  generate 
the  magnitude  and  phase  profiles.  For  each  case 
present^  here,  rf  signals  were  generated  as  described 
in  Section  III  and  introduced  into  the  optical  system 
through  the  AO  modulators.  The  output  images  were 
digitized  and  subsequently  processed  to  estimate 
magnitude  and  phase. 

A.  Bispectrum  Magnitude  Recovery 

The  spectral  envelopes  of  the  signals  used  in  the  first 
magnitude  test  are  shown  in  Fig.  6(b).  The  signal  f  (t ) 
is  a  series  of  sine  pubes  5  p.s  apart.  In  the  frequency 
domain,  the  spectrum  F(  /" )  b  a  set  of  uniform 
amplitude,  constant-phase  tones  200  kHz  apart,  with 
a  total  bandwidth  of  6  MHz.  The  signal  g(t)  is  the 
output  of  a  square-bw  device  followed  by  a  normaliz¬ 
ing  operation.  Its  spectrum  b  also  uniform  in  ampU- 
tude,  and  it  has  a  bandwidth  of  12  MHz.  F(f)  and 
Gif)  acre  quadraticaily  related  to  each  other  and  their 
cross  bbp^rum  should  be  nonzero  and  uniform  in 
amplitude  over  a  6  MHz  x  6  MHz  region  of  the 
bifrequency  plane. 

The  bolated  interference  term  (representing  the 
real  part  of  the  bispectrum  on  a  spatial  carrier)  for  the 
first  magnitude  test  is  shown  as  a  gray-scale  plot  in 


(MH2) 


Fig.  10.  Gray-scale  plot  of  the  isolated  interference  term  for  the 
first  magnitude  test. 
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Fig.  10.  The  fringes  do  appear  to  be  fairly  uniform.  A 
three-dimensional  plot  of  the  magnitude  of  the  bispec¬ 
trum  and  a  cross-sectional  plot  along  the  line  /",  =  3 
MHz  are  shown  in  Figs.  11  and  12,  respectively. 
These  reflect  the  essential  characteristics  of  the  out¬ 
put  we  expect.  Nonuniformity  of  the  amplitudes  of 
the  interfering  beams,  a  slight  nonuniformity  of  focus 
across  the  image  plane,  and  background  noise  contrib¬ 
ute  to  the  distortion  of  the  image. 

The  second  magnitude  test  employs  the  same  input 
signal  fit)  as  the  first,  but  the  signal  git)  is  the  direct 
output  of  the  square-law  device  without  normalixa- 
tion.  The  spectral  envelopes  of  these  signals  are 
shown  in  Fig.  6(a).  Now  Gif)  is  trian^ar  in  form, 
representing  the  convolution  otFif)  with  itself.  This 
shape  determines  the  expected  cross-bispectrum  out¬ 
put,  which  will  look  somewhat  like  a  tent  whose  peak 
Lies  on  the  diagonal  line  Z",  +  /i  =  6  MHz. 

Figure  13  shows  the  isolat^  interference  term  for 
the  second  magnitude  test  in  the  form  of  a  gray-scale 
plot.  Note  that  the  upper  right  tmd  lower  left  comers 
of  the  fringes  are  beginning  to  fade  out.  The  bispec¬ 
trum  magnitude  is  plotted  in  Fig.  14.  Cross-sectional 
plots  along  the  lines  f  =  and  f  +  /i  =  6  MHz  are 
shown  in  Figs.  15(a)  and  15(b),  respertively.  The  first 
cross  section  clearly  exhibits  the  triangular  magni¬ 
tude  profile  carried  hy  Gif ).  The  second,  along  the 


Fig  12  Cross  section  of  the  magnitude  of  the  bispectrum  from 
the  first  magnitude  test  (  f ,  =  3  MHz). 
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Fig.  13.  Gray-scaJe  plot  of  the  isolated  interference  term  for  the 
second  magnitude  test. 

peak  of  the  tent,  has  a  more  rectangular  form,  as 
expected. 

B.  Bispectrum  Phase  Recovery 
The  phase  test  cases  are  based  on  signals  with  the 
same  magnitude  spectra  as  those  of  the  first  magni¬ 
tude  test.  That  is,  Fi  f)  and  Gif)  are  6-  and  12-A^z 
wide,  respectively,  they  are  quadratically  related,  find 
they  have  uniform  amplitudes  in  the  frequency  do¬ 
main.  The  difference  is  that  Gif)  no  longer  has  a 
uniform  phase  profile.  Specifically,  its  phase  profile  is 
parabolic,  as  illustrated  in  Fig.  6c.  The  expected  form 
of  the  magnitude  of  the  bispectrum  should  generally 
be  the  same  as  that  of  the  first  magnitude  test.  The 
bispectrum  phase  (unwrapped)  should  be  a  parabolic 
surface  whose  axis  lies  along  the  line  fi  +  ^2  =  6  MHz. 

The  interference  frame  for  the  first  phase  test  is 
shown  in  Fig.  16.  Note  that  the  spatial  frequency  of 
the  fringes  changes  as  one  moves  along  the  diagonal 
of  the  plot.  The  unwrapped  phase  surface  for  this  test 
is  shown  in  Fig.  17.  Recall  that  prior  to  the  phase 
computation  the  spatial  carrier  was  mixed  out.  The 
domain  of  the  plot  is  limited  to  the  6  MHz  x  6  MHz 


Fig,  14.  Magnitude  surface  of  the  bispectrum  for  the  second 
magnitude  test 
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Pig.  15.  Cross  section  of  the  msignitude  of  the  bispectrum  from 
the  second  msignitude  test  along  (a)  /,  =  and  (b)  f,  +  /",  *  6  MHz. 


region,  where  the  bispectrum  is  nonzero.  This  results 
in  a  less  confusing  picture.  The  characteristic  para¬ 
bolic  profile  can  be  seen  in  Fig.  17  but  becomes  much 
more  obvious  in  Fig.  18 — a  cross-sectional  plot  of  the 
phase  surface  along  /",  =  /i-  The  equation  used  to 


Fig.  16  Gray-scale  plot  of  the  isolated  interference  term  for  the 
first  phase  test. 


Pig.  17.  Phase  surface  of  the  biq>ectniin  for  the  first  phase  test. 

generate  the  quadratic  phase  term  is  given  by 

4>,  »  +  /;  -  6  MHzlVlO  137) 

and  is  shown  as  a  dashed  curve.  The  solid  curve 
depicts  the  measured  phase  values.  Generally,  the 
agreement  between  the  two  curves  is  quite  good. 
There  is  a  slight  separation  of  the  curves  in  the  range 
from  0  to  4  MHz.  This  corresponds  to  the  lower  left 
comer  of  the  interference  pattern  in  Fig.  16.  Note 
that  the  interference  fringes  tend  to  fade  out  in  this 
region  of  higher  fringe  frequency.  Therefore,  the 
magnitude  of  the  bispectrum  is  reduced  in  the  region. 
The  agreement  between  the  measured  phase  vdues 
and  the  true  phase  profile  is  poorer  due  to  the  reduced 
signal-to-background  noise  ratio.  This  is  the  same 
effect  as  that  seen  with  the  loss  of  contrast  in  the  time 
delay  tests.^*  In  the  case  of  time  delay,  it  was  shown 
that  the  Gaussian  beam  profile  acts  as  a  wei^ting 
function  that  reduces  the  magnitude  of  the  correla¬ 
tion  between  signals  as  the  delay  is  increased.  Unlike 
the  effects  of  time  delay,  which  modify  the  fringe 
frequency  and  contrast  for  the  entire  6  MHz  x  6  MHz 
region,  the  quadratic  phase  profile  produces  a  position- 
dependent  change  in  fringe  frequency  and  omtrast. 

The  second  phase  test  employed  a  signal  whose 
spectrum  had  a  three-phase  step  profile  as  indicated 


lrequency(MHz) 


Fig.  18.  Cross  section  of  the  phase  of  the  bispectrum  from  the 
first  phase  teat  ( f,  « 


10  March  1992  /  Vol.  31,  No  8  /  APPLIED  OPTICS  1027 


u 


(MH*) 


Fig.  19.  Gray-scale  plot  of  the  isolated  interference  term  for  the 
second  phase  test. 

hy  Fig.  The  phase  difference  between  successive 
steps  was  120  deg  (2.094  rad).  The  effects  of  this 
phase  profile  are  clearly  manifested  in  Fig.  19,  which 
shows  the  interference  frame  for  the  test.  Note  that 
the  fringe  orientation  was  changed  in  this  test  sr  that 
the  fringes  met  the  phase  steps  at  rou^ly  a  tght 
angle.  This  makes  it  easier  for  the  reader  to  see  the 
effect  of  the  phase  transition  in  the  interference 
pattern.  It  also  illustrates  that  the  procedure  works 
regutiless  of  fringe  orientation.  The  phase  surface  for 
this  test  and  a  cros8>sectional  slice  are  shown  in  Figs. 
20  and  21,  respectively.  The  solid  curve  represents 
the  measured  phase  values  while  the  dashed  curve  is 
the  expected  phase  profile.  The  phase  values  shown  in 
Fig.  21  are  avera^  across  each  step.  The  phase 
differences  between  adjacent  steps  (for  this  particular 
slice)  aare  measured  as  112  deg  (1.95  rad)  and  107  d^ 
(1.86  rad).  Both  estimates  are  within  11%  of  the  true 
value  of  120  deg.  The  phase  steps  are  slightly  smaller 
and  the  transitions  less  sharp  than  we  might  expect. 


teat. 


Fig.  21.  CroM  section  of  the  (ritase  of  the  bi^Mctnim  from  the 
second  phase  teet  ( •• 


This  leads  us  to  suspect  that  our  interference  term 
isolation  procedure  has  filtered  out  more  of  the 
spectrum  of  the  image  than  we  intended  for  this 
particular  case. 

VI.  Summary 

The  primary  goal  of  the  present  research  effort  was  to 
compute  bispectrum  magnitude  and  phase  for  wide 
bandwidth  input  waveforms.  This  hu  been  accom¬ 
plished  through  enhancements  to  an  mdsting  optical 
architecture,  extension  of  the  input  sigi^  generation 
capebUity,  and  throu^  more  intensive  digital  process¬ 
ing  of  the  output  images.  The  improvements  to  the 
optica]  configimtion  provided  the  capability  of  satis¬ 
fying  the  frequency  sum  rule  over  an  extended  region 
of  the  bifrequency  plane.  A  new  signal  generation 
^rproacb  was  incorporated  that  is  capable  of  creating 
wide  bandwidth  test  signals,  ediich  are  related  by 
arbitrary  quadratic  transfer  fiinctions.  Both  develop¬ 
ments  are  necessary  for  the  implementation  and 
testing  of  a  wide  bandwidth  bispectrum  processor. 

The  digital  processing  of  the  interference  images 
that  contain  the  bi^)ectrum  information  todr  advan¬ 
tage  of  the  spectral  characteristics  of  the  images.  The 
interference  torm  containing  the  real  part  of  the 
bispectium  on  a  q»tial  carrier  was  isolate  from  the 
noninterference  badiground  terms  by  hii^-pass  fil¬ 
tering  the  image.  This  avmded  several  shortcomings 
associated  with  an  earlier  approach  in  which  multiple 
frames  must  be  acquired  a^  subtracted.  The  imagi¬ 
nary  part  of  the  bispectrum  was  recovered  by  perform¬ 
ing  a  Hilbert  transform  on  the  isolated  interference 
term.  This  is  possible  given  the  constraint  that  the 
signals  being  analyzed  are  causally  related  to  each 
other.  The  magnitude  and  phase  of  the  bispectrum 
were  comimtad  from  actiial  interference  imagra  for 
several  quadratic  transfer  functions.  The  magnitude 
tests  were  in  good  qualitative  agreement  with  expecta¬ 
tions.  The  agreement  of  the  pli^  t^ts  was  excellent. 

Future  testing  should  be  aimed  at  determinii^  the 
performance  limits  of  the  (qitical  realization  of  the 
bispectrum  in  terms  of  the  accuracy  of  time  delay”^ 
and  magnitude  and  phase  estimates.  The  ability  of 
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the  bispectrum  to  suppress  additive  Gaussian  noise 
should  be  investigated  in  the  context  of  the  optical 
processor.  The  kind  of  rigorous  testing  envisioned 
requires  that  the  processor  be  further  optimized  in 
several  different  areas.  The  quality  of  the  optical 
output  must  be  improved.  This  involves  both  the 
optics  and  the  rf  circuitry  used  to  drive  the  modula¬ 
tors.  The  digital  filters  and  Hilbert  transformer  used 
in  this  work  were  selected  primarily  for  convenience. 
Some  effort  will  be  necessary  to  determine  the  opti¬ 
mal  digital  filters  for  the  specific  processor  applica¬ 
tions  and  performance  requirements.  The  sp^  of 
operation  is  limited  at  present  by  the  image  acquisi¬ 
tion  and  processing  hardware.  A  reasonable  and 
realistic  goal  for  the  processor  in  the  near  future 
would  be  to  compute  the  bispectrum  at  a  rate  of  30 
frames/s  for  a  10  x  iO  MHz  region  of  the  bifrequency 
plane.  This  would  involve  algorithm  optimization  as 
well  as  hardware  enhancements.  In  the  future,  we 
might  consider  other  variations  of  the  implementa¬ 
tion  of  the  bispectrum  that  put  more  of  the  computa¬ 
tional  load  onto  the  optics.  Since  the  bispectrum 
information  exists  as  an  interference  term,  one  inter¬ 
esting  concept  would  be  to  record  this  as  a  hologram 
from  which  the  bispectrum  information  could  be 
reconstructed  as  an  amplitude  distribution  and  subse¬ 
quently  processed  optically.  Another  possibility  is  the 
use  of  an  optical  phase-modifying  element  that  would 
allow  simultaneous  recovery  of  the  real  and  imagi¬ 
nary  parts  of  the  bispectrum. 
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Subharmonic  growth  by  parametric  resonance 

By  M.  R.  HAJJ,  R.  W.  MIKSAD  and  E.  J.  POWERS 

(  'ollegt*  of  Enaioi-*-rin(4,  The  L'niver>it\  of  Texas  at  Austin.  Austin.  TX  78712.  I'SA 

(Ref-eivni  29  Xovemher  and  in  revised  fijrm  18  June  1991) 

An  experimental  investigation  is  conductetJ  in  order  to  quantify  the  nonlinear  and 
parametric  resonance  mechanism.s  that  arc  associatetl  with  the  .subharmonic  growth 
in  the  transition  to  turbulence  in  plane  mixing  layers.  Higher-order  digital  .statistical 
analysi.s  techniques  are  used  to  investigate  the  nonlinear  and  parametric  mechanisms 
resportsible  for  the  energy  transfer  ttj  the  subharmonic.  The  results  show  that  the 
dominant  interaction  is  a  parametric  re.sonan<  e  mechanism  iM'tww'u  the  fundamental 
and  the  subharmonic  modes  which  leads  to  a  pronounct'd  growth  of  the  subharmonic. 
Measurements  also  indicate  that  the  fundamental.  Iresides  intc-racting  with  the 
subharmonic,  is  also  engaged  in  redi.stributing  its  energy  ti>  the  other  Fourier 
components  of  the  How  via  nonlinear  three-wave  interaction.s.  Local  wavenumber 
measurements  vtuify  that  fre<juency  waxenumber  resonainx-  matching  conditions 
exist  between  the  fundamental  and  subharmonic  in  the  region  w  here  the  subharmoni*' 
gains  its  energy  by  parametric  resonanc<'.  The  results  are  in  general  agreement  with 
theoreti<al  models  In  Kelly  'llKiT).  and  Monkewitz  (198H)  on  subharmoni<' 
grow  th. 


1.  Introduction 

Two  important  features  of  the  transition  to  turbulence  in  plane  mixing  layers  are 
the  .'ubharmonic  grow  th  and  vortex  pairing.  The  emergence  and  subsequent  grow  th 
of  a  flow  comjjonent  at  the  subharmonic  frequency,  beyond  the  initial  linear 
instability  region,  is  a  i:ood  example  of  secondary  in.stability.  and  represents  the  first 
step  in  the  sequence  of  instabilities  that  lead  to  the  final  breaktlown  to  turbulence. 
Evidence  of  the  growth  of  the  subharmonic  mode  has  been  given  in  many 
experimental  investigations,  such  as  those  of  Sato  (1959).  Browand  (1966),  Miksad 
(1972)  and  many  others.  The  nu  chani.sms  of  vortex  roll-up  and  subsequent  pairing 
represent  the  vorticity  restructuring  in  the  transitioning  mixing  layer.  Evidence  of 
vortex  pairing  has  been  observed  in  the  flow  visualization  pictures  of  Winant  & 
Brnwaml  (1974)  and  Brown  &  Boshko  (1974).  Ho  &  Huang  (1982)  used 
hydrodynamic  instability-wave  concepts  and  flow  visualization  to  compare  the 
development  of  the  fundamental  and  subharmonic  modes  to  the  vortex  pairing 
mechanism  iji  mixing  layers.  Their  results  showed  that  the  saturation  of  the 
fundamental  mode  accompanies  the  vortex  roll-up  proce.ss.  and  that  the  saturation 
of  the  subharmonic  mode  accompanies  the  vortex  merging.  Also,  the  analysis  of 
Pierrehuinbert  &  M'iflnall  (1982)  showed  that  the  emergence  of  a  subharmonic 
component,  in  the  two-dimensional  case,  may  correspond  to  the  pairing  of  two 
neighbouring  vortices.  Ho  (1982)  sugge.sted  a  .simplified  model  in  which  the  mixing 
layer  grows  only  ly  the  vortex  pairing  mechanism.  In  general,  the  enhancement  or 
suppres-sion  of  the  spreading  mixing  layer  seems  to  be  dependent  on  the  ability  to 
control  thi.s  vorticity  restructuring  mechani.sm  and  consequently  the  .subharmonic 
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perforated  plates 


Fn.i  P.K  I.  Scheiiiatic  sketch  ut  the  tunnel  a|t|mrHt(is. 


trtindition  in  the  \va\ cmitnlHi'  tlntnain  for  (‘fficient  cxthanjjt'  via  i/araincfric 

resonance  is  established. 


2.  Experimental  set-up 

The  expel inunts  were  conducted  in  a  low  turhulence  subsonic  wind  tunnel  with  a 
30x20x  l.)<t  cm  test  section,  see  figure  1,  The  mixing  layer  was  fonnetl  by  merging 
two  laminar  streams  as  shown  in  figure  2.  'I'he  upiH'r  atul  lower  frw-stream  velocitie.s 
were  T,  =  7.17  and  /  j  =  1 .51  in/s.  resiK^etively.  This  resultetl  in  a  velocity 
differential  Af  =  5.fi6  tii/s  and  a  velocity  ratio  /f  =  +  f  —  0.65.  The 

initial  Reynolds  number  based  on  initial  momentum  thickness  and  upfx>r  fret*-stream 
velocity  was  305.  The  free-stream  turbulence  intensity  in  the  vicinity  of  the  trailing 
edge  of  the  splitter  jilate  was  0.(KM)5Ar  in  the  high-s|K-ed  stream,  and  0.001 1  Af '  in 
the  low-speed  stream,  .Most  of  th(>  background  furbulenre  was  concentrated  in 
frequencies  behtw  4<i  Hz.  Streamwise  velocity  fluctuations  wen*  measured  using  a 
DISA  56X/f’  hot-w  ire  anemometry  system.  .\  sj)ecial  [irobe  (Jones  ct  al.  1988)  with 
two  sensing  elements  was  use<l  in  these  experiments  to  measure  the  spectral  energy 
exchanges  between  tw  o  downstix-ain  locations,  figure  2.  The  two  .simsing  elements  are 
separated  in  ihe  streamwise  tlirection  by  a  distance  Ar  =  0.1  cm  and  in  the  spanwise 
direction  by  0.15  cm.  centre  to  (cntre.  The  cross-stream  st*paration  is  zero.  The  small 
spanwi.s(>  separation  is  used  to  isolate  the  downstream  wire  from  the  wake  of  the 
upstream  wire.  The  hot-wire  signals  were  DC  and  anti-alia.s  filtered  and  wen-  sampled 
with  a  (■.\.MAC  fligital  data  acquisition  system.  The  sampling  frequency  was  set  at 
HKX)  Hz.  The  results  shown  h(‘re  are  foi  natural  tran.sition  ex<  ited  by  random 
fluctuations  in  the  flow  .  Xof'xternal  foiling  was  u.sed.  The  measured  frequency  of  the 
dominant  instability  mode  of  this  transition  was /„  =  215  Hz. 


3.  Mean  flow  properties 

Th(>  fimc-avci aged  flow  properties  were  measured  at  midspan.  The  average 
velocity.  C  =  ^  is  equal  to  4.34  m/s.  The  initial  momentum  thickness  of  the 

high-sfR‘ed-side  free-stream  velocity.  (9„.  is  equal  to  0.064  cm  and  the  streamwi.se 
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wavflength  of  the  dominant  instability  mode.  A^.  is  et^ual  to  1.98  cm.  The  Strouhal 
number  (jf  the  dominant  instability  mode.  f^OjV.  is  equal  to  0.0317.  In  order  to 
compare  our  results  with  data  taken  in  ex{)eriments  at  different  values  of  R. 
Iength.scalc.s  are  normalized  with  the  ratio  R/A„  =  0.32  as  sugge.stcd  by  Huang  &  Ho 
(1990).  The  variations  in  the  cross-stream  direction  of  the  longitudinal  mean  velocity 
and  total  r.m.s.  fluctuations  at  different  downstream  locations  an*  shown  in  figure  3. 
The  mean  velocity  profiles  are  normalizc<l  according  t;.  the  ndation 
The  velocity  fluctuations  are  normalized  with  Af.  The  mean  flow  profiles  show  a 
r  ombination  of  a  wake  and  a  mixing  layer  up  to  Rx/A^  -  10.  Beyond  this  location, 
they  become  similar  to  a  hypt'rbolic  tangent  profile. 

The  vertical  spn-ading  of  the  mixing  layer  is  measured  by  the  local  momentum 
thickness,  defined  a.s 
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FliifRK  ('ro8s-.str»-ain  variations  of  the  normalized  mean  veloi-ity.  -(f  (;/)  —  f ')/Af '  B  )  and 
longitudinal  fluetuations.  u'/HC  (  ♦ — )  at  different  <lownstream  locations:  (n)  Hr/,\„  =  O.o.  (6) 

I.O;  (r)  1..-,;  (rf)  2,0:  (e)  2.7;  (/)  .i.O. 

Similarly,  an  energy  thiekness.  defined  as 


is  used  a.s  a  measure  of  the  flux  of  energy  from  the  mean  H()w.  The  downsf rt>am 
development  of  the  maximum  of  the  fundamental  and  subharmonic  modes  are 
shown  in  figure  4.  The  development  of  the  energy  thickness  and  the  momentum 
thickness  are  shown  in  figure  o. 

In  figure  4.  four  basic  regions  of  fundamental  and  subharmonic  development  are 
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mixing  layer  Htn'ainxvixe  dim-ti<>n. 

cvkk-nt.  For  /^.r/A„<  1.0.  the  funrlamental  and  auhharmonie  rntKle.s  grow  e.x- 
p<jnentiall\ .  The  non-tliniensioral  growth  rate  of  the  fiiiKiainental  and  the 
subharmoiiie.  based  on  0^.  an*  0.10  and  0.047.  respectively.  Thes«*  values  an*  within 
<)%  of  the  valne.s  predicted  by  the  spatial  linear  instability  analysis  of  Monkewitz  & 


391 


Siihhftrmonir  tjrouth  hy  jfMrti metric  rfHiniunr* 

Huerre  (1982)  Tht*  region  limited  by  Hx/A^  <  10  reprt‘«ent«  the  itutial  instability 
region  where  the  fluctuations  components  grow  exponentially,  and  will  lie  rcferrtsJ  to 
as  Region  I.  At  about  HxfA^=  I  <1.  the  subharmonic  starts  deviating  fn>m 
exponential  growth.  This  location  is  chosen  for  refenmce  beeauae  it  is  the  last 
location  at  which  the  subharmonic  amplitude  deviates  by  lea;'  than  o%  from  the 
amplitude  predicted  by  its  initial  exjioneritial  growth  Note  that  the  choice  of  this 
criterion  is  .subjective  and  is  stated  here  for  the  sake  of  comparison.  At  this  location, 
the  r.m.s.  amplitude  of  the  longitudinal  fluctuations  at  the  fundamental  frequency 
has  an  average  value  of  u'f^jC  =  0.012  {[leal:  value  =  0  016).  This  value  is  almost 
twice  the  critical  fundamental  amplitude  of  iK)lo3jt*  =  0.006  predicted  by 
Monkewitz  (1988). 

At  fix/Afy  =  1.6.  the  fundamental  achieves  a  saturation  level  of  0  ll3f'  This  level 
is  one  order  of  magnitude  larger  than  that  of  the  critical  fundamental  amfilitude 
Xotice  that  ufi  thi>  location,  figure  .>  uulicati's  that  the  vertical  spreading  of  the 
mixing  layer  and  the  c\tra<  tion  of  energy  from  the  mean  flow  an-  very  slow  Tin- 
region  fx'tween  Hx/A„=  1.0  anti  1.6  represt-nts  a  region  where  the  funtlann-ntal 
mode  continues  to  grow  at  a  rate  less  than  that  |>reilicttHl  by  the  linear  thetiry,  to 
eventually  etjuilibratc  into  finite-amplitude  oscillationH.  This  region  will  la-  referred 
to  as  Region  11. 

Bt'twtH-n  Hx/A,y  =  1.6  anti  3.2.  the  r  ms.  amplitiitle  of  the  longitutiinal  Hm  tuation.s 
at  the  fundamental  fretjuency  tlecrcast-s.  Huwevt-r.  the  subharmonic  (omjHuu-nt 
underg(M‘s  a  second  region  of  growth  and  achieves  a  .saturation  Icvt-l  of  the  .same  onlcr 
t)f  magnitude  as  the  funtlainental.  The  it-gion  liinitetl  by  Hx/A,,  =  1.6  and  3.2  will  fa- 
referred  to  as  the  secondary  instability  region  {Region  Ilf),  in  figure  6.  we  iiotiet-  that 
a  large  spreading  of  the  mixing  layer  also  ot  turs  over  this  region.  The  momentum 
thickness  doubles  betMOon  /ix/A,j=  1,6  and  3.2  indieating  that  the  extraction  of 
energy  from  the  mean  flow  is  increasing,  if  Kelly  s  (1967)  interat-tion  met-hanism  is 
eorret-t.  this  implies  that  the  energy  is  f«-ing  extracted  from  thi-  mean  flow  and 
transfern'd  to  the  subharmonic  by  a  parametric-  re.sonancc  meehanism.  Bi-yoncl 
lix/A„  =  3.2.  the  r.m.s.  amplitudes  of  thi-  longitudinal  fluetuations  at  the  fun¬ 
damental  and  .subhannonic  modes  start  decreasing.  This  eouhl  b<’  the  result  of  the 
second  subharmonii .  |/„.  or  the  onset  of  the  three-dimensional  activities.  This  region 
will  bi’  n-ferred  to  as  Region  l\’. 


4.  Fundamental-subharmotiic  interaction 

The  primary  aim  of  this  investigation  is  to  quantify  the  spectral  energy  exchanges 
arising  from  parametric  or  nonlinear  wave- wave  interactions  that  are  responsible  for 
the-  subharmonic-  growth  in  a  plane-  mixing  layer.  These  interactions  can  best  1m' 
studied  by  using  polyspectral  analysis  techniques.  The  role  of  these  interactions  in 
reinforc  ing  the  subharmonic-  can  be  quantified  by  measuring  quantities  such  as  the 
linear  and  nonlinear  tiansfoi  functicjii.s.  The  connection  l>etwt*en  existing  theoretical 
models  of  subharmonic-  growth  and  our  use  of  the  transfer  function  measuremcnt.s  is 
not  obvious  and  the  follow  ing  heuristic-  model  is  presented  to  motivate  this  approach. 

4.1.  Fundamental-subharmonk  energy  transfer 

As  shown  by  many  investigators  and  detailed  by  Mollo-Christen.son  (1971).  the 
turbulent  energy  transfer  from  the  mean  flow  to  the  primary  instability  c  omponents, 
and  from  the  primary  in.-<tability  to  the  secondary  instability  components,  and  vice 
versa,  occurs  bec  ause  of  nonlinear  terms  that  are  present  in  the  equations  governing 
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energy  exchange.  The  importance  of  these  terms  to  the  subfjartnontc  generation 
process  is  evident  in  the  equation  governing  energj-  exchanges  ^>etMeen  the  different 
components  of  the  velocity  fluctuations.  Stuart  (1980).  Monkew  itz  (1988),  based  on 
the  experimental  results  of  Ho  &  Huang  (1982)  and  the  discussion  of  Ho  &  Huerre 
(1984),  assumed  that  subharmonic  behaviour  in  mixing  layers  can  Ik*  considered  to 
be  locally  parallel  in  the  saturation  region  of  each  mode  in  the  subharmonic  sequent'C. 
Kelly  (1967)  made  a  similar  assumption  in  his  temporal  analysis.  Following 
Monkewitz  (1988)  and  Kelly  (1967).  the  effect  of  interacting  comprments  on  the 
growth  of  the  subharmonic  component  in  a  parallel  flow  can  be  written  as 


where  represents  the  subharmonic  component  of  the  .stream  function  and  ^(J^) 

and  represent  any  two  components  whose  interactions  could  lead  to  the 

subharmonic  reinforcement.  Note  that  the  summation  on  the  left-hand  side  is 
necessary  to  take  into  account  all  sum  and  differeiu-e  interaction.s  that  could  affect 
the  subharmonic  comfionent. 

The  analyse.*;  of  Kelly  (1967)  and  Monkewitz  ( 1988)  showtsl  that  the  manifestations 
of  the  parametrii  and  tionlinear  resonance  mechanisms  are  different.  In  the  nonlinear 
resonance  mechanism,  energy  is  passt'd  to  the  subharmonic  directly  from  the 
fundamental.  In  the  parametric  resonani'c  mechanism,  energy  is  luyt  rlirectly 
exchanged  with  the  fundamental,  but  passed  from  the  mean  flow  to  the  subharmonic 
as  a  result  of  the  parametric  variations  introdueed  by  the  fundamental.  These 
differences  lead  to  different  differential  equations.  While  the  nonlinear  resonance 
mechanisms  are  modelled  by  nonlinear  terms  in  the  governing  equations,  the 
parametric  re.sonance  mechanism  is  a  linear-like  phenomenon  in  that  it  is  modelled 
by  linear  terms  with  periodic  coefficients  resulting  from  the  periodic  distortions  of  the 
mean  flow  by  the  fundamental.  It  is  important  to  note  here  that  because  coupling  of 
the  subharmonic  with  the  fundamental  mode  is  either  directly  or  indirectly  involved 
in  both  types  of  mechanisms,  the  phase  coupling  and  wavenumber-frequency 
resonance  conditions  for  the  parametric  interaction  are  similar  to  those  required  for 
nonlinear  interactions. 

Itoh  (1977)  showed  that  by  decomposing  the  Fourier  components  of  the  stream 
function  into  a  sum  of  linear  eigenfunctions.  (3)  can  be  expressed  as  a  wave  coupling 
equation  of  the  form 

”•*  flt/ffm 

where  A(f)  is  the  amplitude  of  the  linear  eigenfunction  of  the  component  with 
frequency  /. 

This  couplinit  equation  describes  the  spectral  change  of  the  complex  amplitude  of 
the  frequency  component  /„  due  to  linear  and  quadratically  nonlinear  mechanisms, 
“(/m)  represents  the  linear  growth,  and  is  the  nonlinear  wave-wave  coupling 
coefficient.  The  first  step  in  determining  these  coefficients  and  hence  the  interaction 
physics,  is  to  express  (4)  in  terms  of  a  transfer  function  equation  that  models  these 
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pliysics.  The  ccK-ffii  ietit.s  of  such  an  equation  are  then  determined  by  measuring  the 
changes  in  the  complex  amplitudes,  *•!(/).  of  the  spectral  component  over  a  distance 
A.>-.  Equation  (4)  can  then  be  transformed  to  an  equation  that  relates  the  complex 
amplitude  of  the  wave  of  frequency /„  at  the  downstream  lo<  ation.  r-f  A.r  linearly 
to  the  complex  amplitude  of  the  same  wave  and  quadratieally  to  the  complex 
amplitudes  of  all  the  interacting  waves  of  frequencies /,  and  /,.  that  add  or  subtract 
to /^,  at  the  upstream  location,  x.  see  figure  2.  Such  a  relationship  is  established  by 
expressing  (4)  as 

y(fm)  =  uf„)xuj+  XI.  /„=/,±/,-  (5) 

In  this  equation.  )'(/„)  and  X{f„,)  are  the  complex  Fourier  amplitudes  at  frequency 
/„,  of  the  fluctuation>  at  two  points  in  space  in  the  transitioning  mixing  layer.  s«h- 
figure  2.  The  quantities  lAf^)  ^iTf  (juadratic  elements  of  the 

transfer  function  model.  res|M'ctively.  and  are  generally  complex  ipiantities  It  is 
inifiortant  to  note  here  some  of  the  characteristics  of  these  transfer  fun<  tions  First, 
the  quadratic  transfer  function,  unlike  the  linear  transfer  tum  tion  is  a  dimensional 
(juantity.  Secontl.  the  complex  am|ilitu<le  at  the  out|>ut  fr»-<juenc_\  eomjionent  is 
(ie|>en<lent  on  the  product  of  the  transfer  functions  and  the  complex  amplitudes  at 
the  input  frequencies.  Third,  the  jiarametrii-  reinfonement  of /,„  will  manifest  its<*lf 
as  a  [lart  of  the  dynamics  represented  by  the  linear  [xirtioii  of  the  transfer  function 
sime.  as  Nayfch  (1987)  notes,  the  parametrii-  a[)|iroach  to  the  fundamental 
subharmonic  interaction  leatls  to  linear  equations  with  |a-riodie  or  (juasi  jMTiixlie 
eoefiieients.  The  subharmonic  prolilem  can  thus  be  reduced  to  a  siniftle  transfer 
function  system  that  is  characterized  by  a  paralUd  combination  of  linear  and 
quadratieally  nonlinear  transfer  functions  {.sei*  figure  2).  Methods  to  estimati'  the 
linear  and  quadratic  transfer  functions  are  given  by  Kim  &  Powers  (I9H8)  and  Kitz 
&  Powers  (1986). 


4.2.  Wavenumber  domain  matching 

Instability -wave  theory  of  parallel  shear  flows  models  the  velocity  fluctuations  as 
travelling  waves  whose  amplitudes  may  vary  in  time  ami  space.  The  growth  of  the.se 
components  can  be  affected  by  the  linear  instability  mechanisms  and  by  (lararnetrii- 
and  nonlinear  interactions  of  the  different  components.  Cross- bisjH-etral  analysis  and 
transfer  function  modelling  can  be  used  to  study  the  characteristics  of  the  energy 
tran.sfer  due  to  these  interactions  between  the  different  modes  in  the  frequency 
domain.  The  spatial  characteristics  and  wavenumber  fre<iuency  matching  con¬ 
ditions  between  interacting  waves  play  a  particularly  imfxirtant  role  in  this  process. 
Bretherton  (1964).  for  example,  studied  wave  triad  resonance  and  amplif  mie  grow  th 
in  a  homogeneous  medium.  For  a  one-dimensional  nonlinear  eipiation.  he  showt'd 
that  wavenumber  resonance  as  well  as  frequency  resonam-e  is  important  for  two 
modes  to  interact  quadratieally  to  drive  a  third  mode.  In  the  case  of  mixing  layers. 
Monkewitz  (1988)  set  the  condition  that  the  fundamental  and  subharmonic  modes 
should  travel  at  the  same  phase  speed  for  an  efficient  transfer  of  energy  to  occur. 
Basically,  this  requires  wavenumber  as  well  as  frequency  resonance  between  the  two 
modes.  This  condition  was  also  assumed  in  the  parametric  res«)nance  model  of  K('liy 
(1967). 

Different  methods  of  extracting  .spatial  charaetcri.stic.s  from  time  series  data  have 
been  applied  by  many  investigators.  Stegen  &  \'an  Atta  (1970)  u.'^ed  the  signals  from 
two  points  separated  in  space  to  measure  phase  sf)ee<ls  <»f  the  Fourier  components  in 
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grid  turbuleni  Beal!,  Kim  &  Powers  (1982)  devised  a  teehnique  to  comjjute  the 
local  crierif.'  >|>ectrum  a.s  a  function  of  frequency  and  wavenumber,  from 
.simultaneous  measurements  of  the  time  .series  at  two  jxiints  in  space.  From  .such  a 
sfK'ctrurn.  one  i-an  then  compute  the  dis{)ersion  rtdationship  and  the  power  ,s[»ectrum 
in  the  wavenumber  domain.  In  the  case  of  tran.sitioning  flows,  the  mea.surement  of 
the  j)hase  difference  betw(*en  fluctuations  at  two  points  may  be  random,  which 
results  in  turbulent  broadening  of  the  di.sfK-rsion  relation.  Thus,  the  wavenumber 
value  a."^  a  function  of  time  can  be  treated  as  a  random  variable  with  a  probability 
distribution  about  a  mean  for  each  frequency  mode.  Wavenumber  measurements  in 
the  pre.sent  e.vperiments  were  made  using  a  tw<i-tcns<ir  probe  as  described  by  Jone*s 
el  ul.  (1988)  and  the  t<'chni(jue  d(‘vised  by  Beall  et  nl.  (1982).  Notice  that  the 
separation  between  the  two  stmsors  shoukl  fw'  sufficiently  small  compared  to  a 
wavelength  arui  coherence  length  to  avoid  spatial  aliasing  and  phase  incoherence. 
resiMu  tively.  For  a  tiefined  wavenumber,  we  have 

A'(/)  =  .4(/),e.\p{i(A'(/)r  +  <;4(/)i.  1 

'•  (6l 

>'(/)  =  d(/)^^,Yr‘‘XI>{i(A-(/UAj-  +  «i(/)l!  j 

Since  A'(/)  and  }  (/)  are  measured  simultaneou.sly.  the  pha.-x-  <^(f)  at  both  jatints  is 
the  .same.  The  sami»l<-  cross-(x)\ver  sjH'ctrum  is  then 

A*(/)  }•(/)  =  --l*(/)^.4(/),.^,e.vp{i(F(/)A.r)!.  (7) 

The  sample  local  w  avenumber  cati  then  b<-  i-elated  to  th<'  local  pha.se  of  the  sainjde 
<ros.s  p<nver  spectrum  and  is  given  by 

k{,f)  =  )>hase  SA*(/>  F(/)J/A.r.  (8) 

The  comfuitational  t«‘chnique  for  the  wavenumber  frt'quency  sjK'ctrum.  S{k.f).  the 
wavenumber  sfiectrum  S(k)  and  the  dis|H'r.sion  relationship  are  given  by  Beall  el  al. 
(1982).  In  this  investigation,  this  techniejue  is  used  tostmly  the  spatial  characteri.stics 
of  the  transition  to  turbulence  in  [dane  ini.xing  layers  and  in  particular  the 
parametric  resonance  mechanism  that  results  in  the  growth  of  the  subharnionic. 

S.  Results  and  discussion 

The  rcsult.s  })rest*nted  in  this  paper  are  ba.sed  <»n  measurements  of  the  stream w  i.se 
velocity  fluctuations  at  cross-stream  locations  corresponding  to  the  maximum  of  the 
fluctuations.  .Mik.sad  (1972)  presented  a  detailed  evolution  of  the  profiles  for 
the  various  instability  modes  involved  in  the  transition  process.  Estimates  of  the 
grow  th  rates  of  the  subharmonic  and  fundamental  modes  by  Mik.sad  (1972)  based  on 
maximum  differ  only  slightly  (10%  <»r  le.ss)  from  tho.se  determined  by  cross¬ 
stream  integration  f)f  total  .Measurements  matle  at  the  maximum  avoid 

problems  of  ri<ling  vip  or  dow  n  a  modal  profile  sIojk-.  a  problem  encotmtered  when 
measurements  are  made  at  a  eonstatit  cross-.stream  distame.  .Jones  (198.3)  showed 
that  the  cross-stream  location  of  maximum  also  elo.sely  coincides  with  the  cross¬ 
stream  location  of  maximum  "  hb-h.  acconling  to  bi.s|K*ctral  analysis  marks  the 
location  where  maximum  nonlinear  or  parametric  coupling  occurs  between 
interacting  mofh 

The  variation'  of  the  power  spectrum  of  the  streaiiuvise  fluctuations  in  the 
<iownstream  direction,  along  maximum  are  shown  in  figure  fi.  In  Region  I,  at 
/f.r/A„=1.0.  the  spectrum  exhibits  |H‘aks  at  the  fumlamental  frequency.  = 
21.7  Hz.  and  its  subharmonic.  The  large  energy  content  at  the  subharmonic  mode 
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Fioi're  6.  Variations  of  the  power  sj)ei'trum  of  the  streamwise  fluctuations  in  the  rlownstream 

direction. 

is  due  to  the  feedback  of  energy  at  this  frequency  from  the  vortex  pairing 
mechanism.  By =  1.6  (i.e.  Region  II).  both  the  fundamental  and  subharmonic 
modes  have  gained  energy.  Also,  the  band  of  fluctuations  centred  at  2/„  have  gained 
a  considerable  amount  of  energy.  The  low-frequency  components  have  also  gained 
some  energy.  By  Rx/A^  —  2.3  (i.e.  Region  III),  the  fundamental  has  lost  some  energy. 
However,  the  subharmonic  continues  to  grow  and  the  vallejs  between  these  modes 
and  the  other  harmonics  are  filling  rapidly. 

The  downstream  development  of  the  longitudinal  fluctuations  at  the  fundamental 
and  subharmonic  frequencies  are  shown  in  figure  4.  Ho  &  Huang  (1982)  showed  that 
the  saturation  of  the  fundamental  correspond.s  to  the  vortex  roll-up  location,  and  the 
saturation  of  the  subharmonic  corresponds  to  the  vortex  merging  location.  In  their 
experiment,  the  vortex  roll-up  occurred  at  R.r/Ag  =  2  and  the  vortex  merging  at 
Rx/Ag  —  4.  In  the  present  experiments,  see  figure  4,  the  fundamental  and 
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subharmonic  saturations  occur  at  Rx/Ag  =1.6  and  3.2,  respectively.  The  ba.sic  .states 
of  growth,  equilibration  and  decay  of  the  energy  of  the  longitudinal  fluctuations  at 
the  subharmonic  and  fundamental  modes  are  observed  and  agree  with  the  earlier 
results  of  Miksad  (1972)  and  Ho  &  Huang  (1982). 


5.1.  Detection  of  coupled  modes 

As  the  instability  modes  interact,  they  become  coupled.  The  characteristics  of  energy 
transfers  between  coupled  mode  components  cannot  be  determined  by  linear 
analysis.  For  example,  the  coherency  function  for  a  linear  system,  defined  a.s 

/(/)  =  E[UX(f)  T(/)P]/{£(1A'(/>|*]F[1T(/)1*]). 

where  denotes  an  expected  value,  can  only  describe  the  linear  relationship 

between  the  input  A'(/)  and  the  output  }'(/)  at  that  same  frequency.  It  cannot 
dc.scribc  the  interaction  of  two  different  frequency  components.  A'{/,)  ant!  at  the 
input,  to  reinforce  a  third  frequency  comfjonent  F(/„)  at  the  output,  such  that 
fm  —  /<i/r  As  noted  by  Powers  &  Miksad  (1987),  three-wave  interactions  can  only 
be  described  by  higher-order  statistical  moments,  such  as  the  cro.ss-bi.sfx'ctrum. 

The  cro.ss-bisiKH  trum.  defined  as 

//(/„/,)  =  E[y{/„,)X*{/,}X*{fj)]. 

where  /„  =  /,  ±/;  is  a  mea.sure  of  the  statistical  de|K'ndcnce  iH'twt'cn  the  output 
.spt'ctral  comfKmont  at  frequency /„,.  !'(/,„).  and  the  input  sjK'ctral  components  at 
frequencies and  fj.  A'(/,)  and  A'(/^),  where The  cro.ss  bi.s|a>ctrum  can  only 
be  non-zero  if  the  frequency  modes  at  /(./,.  and  are  statistically  dependent.  In  the 
cast^  of  three-wave  interaction  phenomena,  this  statistical  defKJndenc-e  arises  ber  aujM* 
of  the  phase  coherence  between  the  interacting  waves.  Thus,  the  product  of  the 
complex  amplitudes  of  the  three  components  will  have  a  non-zero  average  over  many 
realizations.  The  information  that  can  be  provided  by  the  cross-bi.spectrum  for  the 
problem  of  subharmonic  grow  th  is  of  particular  interest  because  it  [>rovides  a  means 
for  detecting  the  presence  of  coupling  between  the  fundamental  and  its  subharmonic-. 
A  quantitative  measure  of  this  coupling  is  given  by  the  cross  bicoherence  squared 

=  \mi.fj)\V{E[\X(/,)X{fjr]  E\\)y„,  )!*))(. 

It  is  important  to  note  that  the  estimation  of  cross- bicohercnce  dep«mds 
significantly  on  the  number  of  realizations  used  to  form  an  average.  Figure  7.  for 
example,  shows  that  for  one  realization  the  cross-bicoheronce  will  mcasuiv  |>erfect 
coupling  between  the  three  modes  under  consideration.  This  is  becau.se  in  this  ca.se 
there  is  no  averaging  and  the  numerator  and  denominator  are  equal.  As  the  number 
of  realizations  is  increa.sed.  the  levt-l  of  bic«>herencc  Imh-oiiics  low«>r.  Finally,  it 
converge.s  to  a  constant  value  of  0.6  after  256  realizations.  The  n.‘sults  [)resented  here 
show  cross-bicoherence  averaged  over  256  realizations,  each  realization  con.sisting  of 
128  samples. 

Plots  of  the  cross-bicoherence.  along  maximum  different  down.stream 

locations  are  .shown  in  figure  8.  These  locations  were  chosen  to  represent  the  different 
region.s  of  grow  th  of  the  fundamental  and  subharmonic  modes.  The  values  mca.sure«i 
at  y?.r/Afl  =  0.32  fall  in  the  primary  instability  region  or  Region  I.  \'alues  at 
Rx/A^  =  1.6  fall  in  Region  II  where  the  fundamental  starts  to  saturate  and  the  sub- 
harmonic  has  deviated  from  its  exjjonential  growth.  Values  .shown  for  Rx/A^  =  1.9 
and  2.3  fall  iti  the  .secondary  instability  region  or  Region  III.  The  results  show  that  in 
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n'ginn  I.  at  Rj/A„  =  the  stronjjfst  [K'ak  offura  at  {/«  —  l/o)  This  shows 

that  tht>  fundamonta!  mode  and  its  .subharrnonii-  :'.rr  phast-  oohm-iit .  This  early  phase 
eohorenee  is  th(‘  result  of  the  feedback  of  energy  from  the  dow  nstream  jMis'rtinn  where 
the  subharmonic  is  emerging  (vortex  |)airing  is  occurring),  as  suggested  by  Corke 
(1987),  Although  the  phase  coherency  implies  the  possibility  of  nonlinear  or 
parametric  interaction  and  transfer  of  energy,  we  are  still  in  the  region  of  exponential 
growth  where  bf)th  modes  arc  still  growing  at  the  linear  rate  predicted  by  the  linear 
instability  theory.  The  nonlinear  effects  are  still  minimal. 

In  Region  11.  at  Hx/A^  —  1 .9.  the  highe.st  level  of  bicoherence  ap}H‘ars  at  (/o-/®)  and 
(2/®.  — /„)  which  indicates  the  production  of  the  second  harmonic  via  the  .self- 
interaction  of  the  fundamental  mode,  as  evidenced  from  the  jmwer  spectra.  Also,  a 
new  bicoherence  pattern  emerges  w  here  the  contour  level  bands  extend,  away  from 
(/o  /o)  and  (2/®.  -/o).  to  include  fluctuations  at  C/’®-/®-^/)  and  {/®  +  <y/,/o)  in  the  sum 
region  and  (2/o±<5/,  — Z®)  and  {2f^±Sf, —f„±Sf}  in  the  difference  region.  The.se 
sideband  bicoherencies  indicate  that  the  fundamental  is  coupling  with  fluctuations  in 
its  skirts  to  broaden  the  second  harmonic.  Note  also  the  bicoherency  between  the 
fundamental  and  the  subharmonic.  (/,.  ±J/o).  and  the  fundamental  and  the  | 
harmonic.  (/®.  ±|/„).  However,  it  i.s  imjmrtaiit  to  note  that  the  levt'l  of  bicohereme 
between  the  fundamental  and  its  suhharmonic.  (/®.  — I/®),  is  lower  in  this  .Tgion  than 
in  Region  I.  Note  also  that  figure  4  show's  that,  in  Region  II,  the  .subharmonic  has 
deviated  from  e.\j>onential  growth  and  started  to  equilibrate.  Therefore,  it  appears 
that  this  equilibration  of  the  subharmonic  is  accompanied  by  a  reduction  in  the 
phase  coherence  Ijctween  itself  and  the  fundamental. 

At  the  beginning  of  Region  III.  Rx/Af^  =  1.9.  where  the  subharmonic  starts  its 
second  region  of  grow  th,  the  bicoherence  level  at  (/®.  —  |/®)  is  larger  than  that  in 
Region  II.  This  indicates  that  the  growth  of  the  subharmonic,  in  Region  III.  is 
accompanied  by  an  increase  in  the  level  of  its  phase  coupling  with  the  fundamental. 
The  above  bispectial  results  demonstrate  that  the  fundamental  and  suhharmonic 
modes  are  phase-coupled  over  Regions  I.  II.  and  III  of  the  transition.  In  Region  I. 
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downstream  locations  that  (•orit'S|)oiui  to  the  diffeivnt  regions  shown  in  figure  4.  Contour  levels 
are  set  at  0.3.  0.(>.  aiui  0,!).  («)  Hegion  I.  li.r/A„  =  0.32;  (h)  Region  II.  R.r/A^  =  I.H:  (r)  Region  III. 
l{.r//\„  =  1.0;  ('/)  Remotl  I\'.  /i’.r/3„  =  2.3. 

the  eoiipling  i.s  a  re.sult  <if  the  feedback  of  energy  at  the  fundamental  and 
subhaitnotiic  fretmencies  from  the  vortcN  pairing  (Corke  1987).  In  Region  II,  the 
level  of  hicoherence  Iteeome.s  lower  as  the  fundamental  starts  to  equilibrate  and  the 
growth  rate  of  the  .siibharimmie  mode  deviates  from  its  initial  exponential  growth 
rate.  As  the  stdiharmonie  starts  its  seeontl  stage  of  growth  in  Region  III.  the  level  of 
bicohcoenee  increa.ses  again.  The  existence  of  phase  coupling  between  the 
fundamental  inorie.  /„.  and  its  subharmonic.  |/„.  provides  a  Hr.st  .step  in  establishing 
evidence  of  a  nonlinear  coupling  or  parametric  resonance  between  the  fundamental 
and  its  subharmonic. 
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5.2.  Linear  ami  quadratic  energy  transfer  analysis 
The  cross^- bicoherence  analysi.s  providc.s  experimental  evidence  of  the  coupling 
between  the  fluctuaticjus  at  the  fundamental  and  subharmonic  modes.  However. 
crosS'bicoherence  measurement.s  alone  cannot  provide  information  on  the  nature  and 
efficiency  of  the  coupling  and  its  role  in  transferring  energy  to  the  subharmonic.  It 
also  cannot  determine  whether  the  measured  three-wave  coui>lings  are  due  to 
interactions  occurring  at  the  location  of  measurement  or  are  due  to  earlier  upstream 
interactions  which  are  advected  past  the  probe  by  the  flow.  These*  cjuestions  can.  to 
some  e.xtent.  be  answered  by  measuring  the  linear  and  quadratic  tran.sfer  functions 
as  shown  in  (5).  see  also  figure  2.  If  we  denote  the  output  of  the  linear  and  quadratic- 
transfer  functions  at  frec^uency by  \\  and  1^.  resfiectivcK .  the*  model  ejutput  at 
frequency  /„,  is  then  given  by 

>'(A.)  =  >!(/„,)+  ><,(/,„)• 

Ritz  &  Powers  (198b)  useel  an  iterative*  a|){)roae‘h  in  the  discrt*te*  frt'cjuency  domain 
te)  estimate  the*  line*ar  and  ejuadratic  transfer  fun<*tions.  A  mem*  gene*ral  approae  h  by 
Kim  &  Powt*rs  ( 1988)  that  dot's  not  ree|uire  iteration  was  use*d  in  our  e*stimat(*s  of  the 
linear  and  (juadratie-  transfe*r  fune*tions.  In  this  a|)proach.  the*  transfe*r  functions  an* 
dete*nnine*d  by  seelving  two  moment  e*<|uatie)ns ; 

and 

A’!  }•(/„, )A*(/,)A'*(/,)l  =  A(/,J  A’|A'(/„,)A'*(/,)A-*(/,)l 

+  SS  (/f:f,K\XU))X{fj)X*{f,)X*{fM.  (11) 

where*/,,,  =  f(+fj  =  A  ±/c  Note  that  tht*sc*  two  c'CjUations  ix*rmit  one  te)  express  the 
linear  transfer  fimctie)n.  L(f„),  and  the  quadratic  transfin*  function,  in  terms  eef 
the  various  poiysfx'ctral  moments  that  can  be*  caleulatc'd  frotn  the  Fourier 
Transforms  of  the  input  and  e)ut|)ut  signals.  re*s|)eetively  given  by  the  upstream  and 
deiwnstream  .sensors  of  the  probe.  For  example,  the  twe)  term.s  on  the  left-hand  .sides 
of  (10)  and  (11)  are  the  cross-powc’r  .spectrum  =  A’| 

c-ross-bispcctrum  /,)  =  £,’[}'(/,„) A’*(A)A'*(A)|.  The  terms  on  the  right-hand 

sides  are  a  hierarchy  of  sjM’ctral  mom<*n(s.  s{x*cifically.  the  auto-power  sf)ectrum 
*'^’r,r(.L)  =  ^-''IA'{/,„)A'*(/,J1.  the  auto-bis}H*etrum  =  K\X{fJX*{f^,)X*{f,)] 

and  a  fourth-order  sj)ectral  moment  of  the  input  A'IA'(A)-^'(A)-‘^*^A)*^*(A)J- 

f)nce  the  linear  and  cjuadratic  transfer  fmictions  are  determined,  one  cati  predict 
the  output  sjH'ctrum.  as 

=  A’l  }•(/;„)  y*{fj]  =  myLUn,)+yQifm))Olifj+  )'^,(/„))*] 

=  >l(A«>l+*fe’li'o(.A,)  >■$(/„.)]+-’  Rt'A’l !',(/„,)  }’*(/,„)] 


(12) 
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f,±frfmfk±fl-fm 

and  6'^^(/„)  =  2Re[  2  L(UQff’^lE[X(fjX*(UX*  [/,)]]. 

/i  ±/>"/ m 

The  ■  predicted '  output  power  spectrum  is  then  the  sum  of  the  linear  power  spectrum, 
Sj^{f„),  the  quadratic’  power  spectrum.  SQ{f„).  and  a  linear-quadratic  power 
spectrum.  It  is  important  to  note  here  that,  in  the  set-up  of  the  probe  used 

in  these  experiments,  the  output  powt.  ^yyUm)  includes  the  input  power 
which  is  advected  b\’  the  mean  flow  i  ,n  the  first  sensor  to  the  second  sensor. 
Because  our  intere.st  is  in  the  local  characteristics  of  the  linear  and  nonlinear  growth 
mechanisms,  we  subtract  the  input  power  from  the  output  power  and  consider  only 
the  change  in  the  fKiwer  spectrum,  A.S(/).  between  the  two  streamwise  locations. 

where  =  [|/dAi)P- In  this  equation,  the  change  in  the  energy 

content  of  the  inwle  with  frequency /„  i.s  dividetl  into  three  parts,  namely  the  linear, 
quadratic,  and  linear-quadratic  change.s. 

A  flower  tran.sfer  rate.  can  then  be  defined  as  the  ratio  of  the  estimated 

change  in  the  energy  of  the  mode  with  frequency /„  and  the  actual  input  energy  of 
that  mode  over  the  distance  Ax.  Ex).  The  power  transfer  rate  can 

then  be  divided  into  three  parts,  namely  the  linear  power  transfer  rate,  rf(/„)  = 
[l/v(/„)p— 1],  the  quadratic  power  transfer  rate.  and  the  linear-quadratic 

power  transfer  rate.  These  are  given  as  the  ratiosof  .Sq(/„)  and  Stol/ml 

over  resjiectivcly.  For  a  perfect  .system  with  no  errors,  all  the  energy  change 

is  accounted  for  by  the  three  parts  and  the  power  transfer  rate  is  equal  to  one.  Also, 
as  noted  in  §4.  the  linear  transfer  component.  rf{/„).  includes  the  effect  of  the  linear 
growth  and  the  energy  transfer  via  the  parametric  mechanism  to  the  mode  of 
frequency/^.  The  transfer  of  energy  via  nonlinear  resonance  of  two  input  modes /, 
and  fj  to  an  output  mode  at  /„  is  included  in  the  quadratic  power  transfer 
component, 

5.2.1.  Quadratic  energy  transfer 

As  noted  in  §4.  the  nonlinear  transfer  of  energy  from  two  input  modes /,  and  to 
an  output  mode  at  depends  on  the  quadratic  transfer  function  This  quantity 
i.s  a  comfilex  function  with  amplitude  and  phase.  From  (5).  it  can  Ik*  shown  that  the 
quadratic  transfer  function  is  given  by 

Q^,^,  =^{y{fj~Uf„)X[f„,))/X{f,)X{f,).  (14) 

Note  that  the  quadratic  transfer  function  is  inversely  dependent  on  the  amplitude.s 
of  input  modes.  A'( /,)  and  X{fj}.  Figure  9  shows  the  logarithmic  contour  plots  of  the 
magnitude  of  the  quadratic  transfer  functions.  measuretJ  at  locations  that 

correspond  to  the  four  regions  defined  above.  Note  that  the  overall  details  of 
plots  change  gradually  in  the  down.stream  direction.  For  instance,  the  magnitude  of 
the  quadratic  transfer  function  i.s  very  low  at  (/o,  —J/o).  at  each  downstream 
location.  From  (14).  it  is  evident  that  this  is  in  part  due  to  the  fact  that  the  energy 
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Fiocke  9.  Logarithmil'  rontour  plots  of  the  magnitude  of  the  quadratic  transfer  functions. 
of  the  longitudinal  fluctuations  at  different  downstream  locations  that  corresjxjiid  to  the  different 
regions  shown  in  figure  4.  f’ontour  levels  are  set  at  0.5.  1.5.  and  2.n.  (n)  Region  T.  ffr/A,  =  0.32;  (b) 
Region  If.  =1.0.  f')  Region  111.  =  2  6;  (d)  Region  IV.  Bx/A„  =  3.2. 


of  the  input  modes  at  /„  and  is  large.  However,  it  is  important  to  note  that, 
although  the  large  energy  levels  at/,.  and  |/o  are  comparable,  the  magnitude  of 
the  quadratic  transfer  function  is  larger  at  (|/„,  — /,)  and  (2f^.  — /#)  than  at  (/„,  — |/o). 
This  suggests  that  the  efficiency  of  the  quadratic  energy  transfer  via  nonlinear 
resonance  is  larger  for  the  (f/,,.  —^)  and  (2/o.  -/<,)  interactions  than  for  the  (/,,  —  |/o) 
li.e.  fundamental — .subharmonic)  interaction.  The  sensitivity  of  the  quadratic 
transfer  function  to  the  energy  level  of  the  input  modes  is  also  apparent  from  the  fact 
that,  at  all  dow  nstream  locations,  measured  values  of  the  quadratic  transfer  function 
magnitude  are  typically  large  for  pairs  of  frequency  components  that  have  low  input 
energy.  This  effect  stems  from  the  fact  that  the  quadratic  transfer  function  is 


402 


J/.  ti.  Hnjj.  H.  H'.  Miknud  and  E.  •/.  PoiruM 


Fioikk  l(f.  <  ‘<iiiti>iir!«  plots  ofthc  int*“ruction  |Kitciitiai.  l.V(/,)A'(/y)r’.  of  the  longitilditiAl  Hiiitiiations 
at  fliffereiit  doM  nstreain  locations  that  corirsjKind  to  the  different  I'ejtions  shown  in  tiffure  4. 
fontour  levels  ai-e  set  at  0.2.  O.o.  and  <1.8.  («)  Region  f.  Hx/A^  ~  <1.32:  (//)  Refiion  IT.  fix/A„  =  l.tt: 
(r(  Region  ITT.  Hx/A,,  =  2.H:  id)  Rejiion  TV.  HxlA„  =  3.2. 


(limonsioiiul.  as  evidnit  bv  ( 14).  and  is  inversely  de{)endent  on  the  eiierj?y  level  of  the 
inteiHcting  modes. 

Miksad.  Hajj  &  Powers  (lO.SO)  proposetl  that  a  gtaal  measure  of  the  amplitutle 
de|H-ndeiKe  of  is  given  by  the  tnagnitiide  of  the  coupled  input  energy  level. 
1A'{/,)  A’(/,)|*.  which  will  be  referred  to  as  the  interaction  |M>tentiul  ’.  This  quantity  is 
a  measure  of  the  potential  amount  of  energy  that  interacting  mixles  have  fttr 
nonlinear  energy  transfei .  Low-energy  modes  have  less  energy  to  transfer  to  a  third 
nuslethan  high-energy  nuKh-sdo.  Figure  10  shows  the  frequen<*y  domain  distributio<i 
of  the  interaction  jwtential  at  four  »lownstr<'am  locations  that  corivspond  to  th<* 
different  regions  of  the  transition,  as  d«-fined  in  t)3.  Note  that  the  upjx'r  and  lower 
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///„  ///,. 

Foil  HK  1 1 .  Contdiirs  nftln-  «*-ighfcH  iioti-diini-iisioiinl  (|tmilniti<-  trati.sfiT  fum-tions.  V''(/,./,)  nftlu' 
loniritudiiirtl  liuctiiatinn'^  al'in}:  max  tindi-r  natural  <  \ritati<>ii  at  ditVfii'nl  duwnstrcam 
liii  ation.H  that  cur tt-sjaind  tu  the  difl'erent  ri')iiiiiis  -.huvvii  in  ti;'ut«-  +  I'untuttr  leM'Is  are  .si-t  at  tt  :t. 
I)  (i.  and  ().!).  (nj  Heejun  I.  =  0.H2:  {h)  |{ej;iiin  H.  f!>'/A„  =  l.ti:  (<•)  Kej'iun  III.  =  2.<>: 

(d)  He>rii>n  IV.  Hj ;  X„  =  .'1.2. 

triangular  reginns  arc  syininctrical.  sinct*  A'(— /)  =  K*(f)  anti  tht'iclurc  |A'(— /)p  = 
lA'I/)!-.  Note  that  in  Ixcgittn  II.  most  of  the  intt*raction  potential  is  centred  tit 
(/„./„)  intlicating  a  significant  iiiftut  potential  for  the  transfer  of  energy  to  the  first 
haiinonie.  The  actual  energy  transfer  th<‘  fir.st  harmonic  can  he  clearly  seen  in  the 
power  .speetta  and  hieoherenee  plots.  The  large  interaction  i»otenfial  at  (/„.  — 1/„) 
indicates  that  signitieatit  energy  transfer  to  the  suhharmoni*-  can  also  occur  in  these 
regions. 

A  nu’asure  of  nonlinear  energy  transfer  can  he  ohtaimal  hy  weighting  the 
dimensional  (piadratic  transfer  function,  (/f'l'f  hy  tlie  energy  potential  of  the 
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intfra(  tiny  in|iUt  inodrs.  !A'(/,)  A’{/^)j-.  to  obtain  a  non  dinu'iiMional  (juadratn  franstbr 
function  such  a- 

QV,h)  =  (!A'(/,) AixrjSl.  /„  =/,  ±j,  ( i:,) 

Contour  plots  of  the  weighted  non-dirnen.sional  transfer  functions,  along  rnaxinuiin 
are  show  n  in  figure  11.  Pairs  that  show  high  4*  *(/,•/>)  characterize  tho.s<* 
intera<  tions  w  hich  are  potentially  most  efficient  in  transferring  energy  Note  that  at 
Rxi  =  0.32.  there  are  no  indications  of  any  prospect  for  encrg\  transfer  ta‘twc«-ii 
any  conifionents  This  location  corre.sponds  t<»  Region  I  the  region  of  primary 
instability  These  results  agn-e  w  ith  the  predictions  of  the  linear  stability  theory  that 
all  mofles  grow  indepentlently  in  the  early  region  of  the  transition  .More  im|>ortant 
to  the  (piestion  of  siibharnionic  i;eneration  is  the-  fact  that  although  the  i-rosv 
bicoherency  plots  indicate  nonlinear  coui)ling  .starling  at  /(’.j /,l„  =  n  32  th<'  tV  i* 
ineasurements  indii-ate  that  tin-  pros)H'<-t  for  (juadrutic  energy  tiaiisfir  to  the 
subharinonii-  in  this  region  is  not  significant.  'I’his  demonstrates  that  the  growth  of 
the  subliarmonic  in  Region  1  is  not  due  to  a  nonlinear  interaction  nuchanism  Tlu' 
first  involvement  of  the  subhannonic  in  nonlinear  interaction  ap|M’ars  in  the  sum 
region  at  P.r/,\„=  1.0.  at  the  end  of  Higion  II.  Recall  that  b\  this  loiafion  the 
fundamental  has  starteil  to  etpiilihrate  and  the  subhannonic  growth  rate  lias 
deviated  from  its  initial  exponential  growth.  However  the  invohcinent  of  flu 
stibhannonic  component  in  nonlinear  .energ\-  transfe-r  does  not  become  significant 
until  =  2. <5.  where  the  contour  levels  of  iy'l"  at  (/„.  and  (|/„.  -/„)  arc 

larger  than  at  earlier  locations.  This  indicates  that,  up  to  this  ioiuti(»n.  the  pros|H  cl 
for  (|uadratie  energy  transfer  to  the  subhannonic  is  ver\  low  ,  Kurt  her  ilow  nstn-am 
at  lix/A„  =  3.2  in  Region  III.  the  subhannonic.  the  fuiulamental  and  their  harmonics 
have  enteri-d  into  a  wide  variety  of  interactions  with  other  fluctuations  and  the 
contours  indicate  the  initiation  of  a  redistribution  of  subhannonic  and  fundamental 
modi'  en<‘rg_\  to  the  differi'iit  comjioiu'nts  of  the  flow. 

.1.2.2,  Lhtfur  (tnd  fnimiiii'lt  if  t  nf'iyi/  Irniisft  r 

'I'he  afiovi'  measurements  of  the  (piadratic  transfer  fuiK'tion  and  its  weighted  lum- 
tlimensional  function  show  an  important  ix-sult.  Roth  the  (piadratii-  transfer 
function.  \Q\^.  and  its  wcighterl  form.  show  large  magnituiles  at  (2/„.  — This 
indicates  efficient  transfer  of  energy  la-tween  thesr*  modes.  On  the  othi’i  hand,  the 
level  of  the  magnitvules  of  these  functions  at  the  fumlamenlal  subhannonic.  (/„. 
—  if,,),  imlicate  that  the  [irosjiei-ts  for  ((uadratii’  transfer  of  energy  due  to  nonlinear 
resonance  from  the  fundamental  to  the  subhannonic  an'  low.  'I’he  i|Uestion  that 
arises  is  then.  What  is  the  role  of  the  linear  transfer  function  that  includes  the 
parametric  effects  on  the  growth  of  the  subharmonic  '  This  (piestion  can  Im'  answered 
by  examining  the  linear  transfer  function  and  its  role  in  the  transfer  of CiH-rgy  to  the 
subharmonic  and  l»y  comparing  it  to  the  corresponding  role  of  the  (|uadratic  transfer 
function. 

The  grow  th  rate  of  the  fundamental,  siibharnionie  and  f  harmonii  due  to  linear 
and  linear-like  effects,  at  dififerent  dow  nstn’am  locations,  are  given  liy  the  plot  of  the 
linear  power  transfer  rate,  Tf{f„,).  as  shown  in  figure  12.  Note  that  in  Region  I.  the 
linear  power  transfer  rati's  of  the  th'—e  modes  are  eon.stant  and  erpial  to  the  growth 
rates  as  predicted  by  the  linear  instability  theory  of  Monkeuitz  &  Huern'  (19821.  In 
Region  II.  between  /fj'//\„=:  1,0  and  R.r/A„=  l.fi.  the  level  of  the  linear  power 
transfer  rate.  rff,^,).  of  these  modes  starts  to  decrease.  'Phis  decrease  in  the  grow  th 
ol  the-e  modes  i'  exfieeted  because  of  the  spreading  of  the  mixing  layer.  Rexond 
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Fiiii  RK  12.  Ijiiphi'  [iiiwpf  transfer  rate.  r/(/  )  i>)  the  fnn<laiiiental. /„  (  Q  ).  the  suhharinonie 
j/n(  X  )  «'''l  the  ?  haniKiiiH’  if  {  #  )  i  *.  (l•lUl)slreatn  tlistaine 

Kv'Siittti  ll.  tm\y  thf  liwear  power  tv.\ti>Vev  of  ti\e  sttUh»»rTttt»tie  niei  ^  harniotiit  arc 
showti.  The  i'uiulaiiu'iital  j^rowth  is  m-jrative  and  thfirfore  is  not  shown.  In  I^Kion 
III.  Iietwecn  li->'/A„  —  l.h  and  Jir/A,,  =  2.<i.  the  linear  |M»vver  transfer  rate.  Tf{f,„).  of 
tlie  ^  harmonic  continues  to  decrease.  In  contra.st.  the  linear  [tower  tiansfer  rate  of 
the  sultharmonic  increases  Only  a  iinear-like  mechanism,  such  as  the  [tarametrit 
mechiinism.  may  e.\|j|aiti  the  increa.se  in  th<‘  linear  [tower  transfer  rate  of  the 
suhharinonie.  1  hese  ii’.sults  indictite  that  the  ffrowth  of  the  .stdiiiannonic  in  Hejiion 
III  is  iU('om[)anied  by  an  increase  in  the  lintsir  transfer  of  [atwer  as  a  n-sult  of  the 
]iara metric  resonance  mechanism. 

The  total  energy  change  of  the  suhharinonie.  A.Sfi/J  at  different  dovxnstieam 
locations  is  shown  in  figure  13(u).  Also  shown  on  the  .same  [tiol  are  the  [tortious  of 
ANi/',,)  that  are  due  to  linear  and  qu  idratie  effects.  iV,  (|/o)  and  <‘'^(5/,,).  res[i<'et ively, 
SiiKc  the  parametric  resonance  mechanism  is  a  linear-liki'  mechani.sm.  its  activitv 
will  a[)[X'ar  in  the  linear  terir  tif  (ii).  Tlie  transfer  <tf  energy  to  the  suhharinonie  due 
to  nonlinear  interai  tions.  is  twit  tit  three  orders  of  magnitude  less  than  the 

energx-  transfer  hy  [larametric  linear-like  meehani.snis.  Although  hoth 

[laraiuetrie  and  nonlinear  nu'chanisms  are  simultaneously  influencing  the  suit 
harmonic',  the  results  dearly  show  that  the  cliange  in  the  [tower  of  the  sultharmonic 
mode  is  mainly  due  to  a  [larametric  resitnanie  and  only  small  noidinear  effects  are 
[tre.sent. 

The  nature  of  the  energy  transfer  to  the  |  harmonic  in  Hegion  III  stands  in 
contra.st  to  that  of  the  sultharmonic.  Figure  f:i(b)  .shows  the  total  out[iut  eni'rgv 
(hange  of  the  |  harmonic.  A-S'(?/,,)  at  d’fferent  downstream  locations.  I’he 
contrihutions  of  and  to  the  itverall  energy  change'.  A.S'(|/„).  are  ulsit 

shown  on  the  same  figure.  Note  that  iit  Hegion  I  the  linear  energy  transfer  to  */„. 
■''/.(l/ijl  dominant.  However,  as  sve  mctxe  further  downstn'a.n  to  Hegion  III. 
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Fku  bk  t!?.  ((/)  Total  jMiWHi'  clums'i'.  and  linear  and  <|uadrati<-  |Mixvei  tr.ui-li  t  ■'■;(*./„)  and 

.Vy(i/„).  rexiK'ctivelv.  >•«  flow n.'treain  di^tanre:  and  Total  |«nv<’r  rhani'e  A>{3/„)  and  liin-.n  and 
((uadratic  ])owi*r  transfer  and  ifs|H'<  tivel\ ,  ;•«.  dii\» nstrcaiii  distains'  (  3  .  A.'' 

■  n  .  N,  ;  ♦  .  f'  I  Rat ii)  of  ((iiadratic  to  linear  |>o«ei  transfer  rati'  of  the  siihhai  nionn  l./„ 

(-  ♦  ■).  and  the  ^  harnionii-  \f„  (  3  )  is  downstreain  distaine. 


in  Kogion  HI.  nonlinear  interactions  art*  |triniarily  res|ionsi()le  for  tlie  jrrowfli  of  ^/„ 
anti  that  parametric  effects  are  [triinarily  rt'sponsihie  for  snitharinonic  iriow  th. 

Wnn-iiitmlu-r  fri'qnniry  niairhing 

The  spatial  characteristics  of  the  transitioning  mixing'  layer  are  ohtiiincd  nsiii)'  the 
techni(|iic  described  in  §4.  These  characteristics  are  seen  in  the  estinnites  of  the  local 
wavenumber  fretpiency  spectra  S{k.f)  that  arc  shown  i*  hjrure  14.  Krom  t!;e;;<  jdots 
we  can  detect  some  features  of  the  disjtersion  ndationshijn  At  /i  v/A„  =  onls  the 

fundamental  anti  subharmonic  modes  have  a  definite  dis|M'rsion  relationship.  By 
Iix/A„  =  1.6.  we  notite  the  (feneration  of  the  second  harmonic.  Al.so  notice  that  the 
harmonic  bands,  at  /„.  5/0  and  2/„  are  confine<I  to  a  relatively  narrow  ran^rc  of 
wavenumbers.  By  Kx/A„  =  2.3.  we  notice  the  broadening  of  the  s|H‘i  trum  alonjf  the 
harmonic  components  and  the  valleys. 

In  order  to  ftain  bett<‘r  insight  into  tln'  sjiatial  charac-teristii-s  of  the  difli'n’Ut 
comfionents  of  the  How.  plots  of  the  local  wavenumber  spectra  are  show  n  ir  figure  1  ">. 
In  the  primary  instalnlity  region  or  Kegion  1.  at  lix/A„  =  1.0.  most  of  the  eiu-rgy  is 
contained  in  a  baiul  of  wa\enumbers  <'entre<l  at  />„  =  3.4  ra«l/cm.  which  corrcsp<in<ls 
to  a  wavelength  of  l.«3cm.  This  value  is  very  close  to  the  wavelength  of  the 
fundamental  instability.  =  1.98  cm.  By  Rx/Af,  —  2M.  or  in  Region  II.  different 
wavenumber  components  have  gained  some  energy,  leading  t<i  broadening  cjf  the 
spectrum.  In  Region  III.  between  RxfA^  =  2.6  and  3.2.  the  wavenumbei  spt'ctra 
shov<  that  most  of  the  energy  is  contained  in  a  band  of  wavenumbers  centred  at 
i  =  1.7  rad/cm  (=  \k„).  w  hich  is  half  the  value  at  w  hich  the  initial  jjeak  is  located. 
Therefore,  as  the  maximum  energy  in  the  frequency  spectrum,  in  figure  6.  shift.s  from 
/fl  2/0  'vavenumber  spectra,  in  figure  lo.  shifts  from  to  This 
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Fl<;i  KK  14.  \Vav**imml)*'r”frequ«*iu’y  «p*-<*tni.  of  the  iongitu<iiiial  fliictuationR  at  difrcrpnt 

downstream  locations  that  corresp<»nd  to  the  <tifTerent  regions  shown  in  figure  4.  Contour  levels  are 
.set  at  10  *.  I0'“.  and  10“*.  (o)  Rjc/A„  =  0..‘i2:  (/>)  1.0;  (r)  2.3;  (rf)  3.2. 


shift  occurs  after  the  primary  instability  region  of  the  transition,  and  coincides  with 
the  .second  region  of  suhharmonic  growth.  The  merging  of  the  two  vortices  rt'duces 
the  frequency  of  their  passage  from  to  and  increase's  their  characteristic  size 
from  4  =  2n/k^  to  21^.  'rhe  shifting  of  the  centre  of  the  band  from  A',  to  JA,  represents 
an  increa.se  in  the  i  haracteristic  .size  of  the  v*>rti(!t*.s  and  therefore  provides  a  spatial 
indication  of  the  vortex  merging  process. 

An  important  condition  for  {)arametric  re.sonance  to  take  place  is  that  the 
subharmonic  and  the  fundamental  mtides  travel  at  the  same  phas<'  sjoeed.  Thi.s 
condition  wa.s  as.sunicd  by  Kelly  (1907)  and  [>ostulated  by  Monkewitz  (1988).  Figure 
16  .shows  the  variation  of  the  normalized  pha.se  speeds.  {(o/k)/l\  of  the  fundamental 
and  the  subhartnonic  in  the  downstream  direction.  Notice  the  good  matching 
between  the  two  modes  beyond  /f.r/A,,  =  1.6  (i.e.  Regicm  111).  This  .shows  that  the 
condition  of  wavenumber  re.sonance  is  satisfied  beyond  the  initial  region  of  growth 
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Fi(;rHK  15.  Wayf-rnitiitM-r  sja-ctra  of  the  longrtudinal  fluctuations  at  downstream  locations  that 
I'orrcsjiond  to  the  different  ix';:ions  shown  in  fitjuiv  4. 

of  tlie  subhaniionii-  l  itis  is  in  aftifcinfiit  with  the  retjuifenifiits  of'thi'  attalvsis  <i| 
Kelly  (1907)  aitd  .Mojikeuitz  (1988). 

Monkewitz  (198,S)  shtnved  that  the  am|jlitii(le  of  the  ftitHlameiital  inode  nni.st 
attain  a  critical  value  before  it  becomes  phase  locked  with  the  siiliharinottic  and  the 
growth  rate  of  the  subharmonic  is  modified.  At  this  location  the  phase  sjx'cd  of  the 
subharmonic  is  close  to  tinity.  i.c.  the  pha.se  siHTtl  of  the  neutral  funflamt'utal 
Monkewitz  (1988)  also  showed  that  when  the  fundamental  amplitude  is  larger  than 
or  cfpial  to  the  critical  amplitude,  the  growth  of  the  siibharinonic  and  its  jihasc-  spi'cd 
are  depindent  on  the  jiha.se  difference,  A^5.  between  the  fumlamental  and  the 
subhannonic  (see  figures  ti  and  7  in  Monkewitz  1988).  These  figures  show  that  the 
optima!  conditions  for  the  subharmonic  growth  occur  when  the  phase  difference 
between  the  two  modes  is  <-lose  to  zero.  Monkewitz  also  showed  that  when  the 
fundamental  amplitiKle  is  decreasing,  the  growth  of  the  subharinonir-  is  enhanced 
regardless  of  the  initial  pha.se  difference  (see  figure  9  in  Monkewitz  1988).  The  results 
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of  our  experiments  show  that  when  the  fundamental  reaches  its  critical  amplitude 
the  growth  rate  of  the  subharmonic  is  decreasing,  the  phase  speed  of  the  subharmoinc 
is  not  equal  to  unity  and  the  energy  transfer  between  the  fundamental  and  the 
subharmonic  is  not  efficient.  However,  these  conditions  for  the  subharmonic  growth 
are  met  further  downstream,  beyond  Rx/A^  =  1.6.  Figure  17  shows  the  variation  in 
the  j)hase  difference  between  the  fundamental  and  the  subharmonic  in  the 
downstream  direction.  At  Rx/A„  =  1.0,  where  the  fundamental  amplitude  is  close  to 
the  critical  amplitude,  the  phase  difference  between  the  fundamental  and  the 
subharmonic  close  to  zero.  By  comparison  to  figure  6  of  Monkewitz  (1988).  we  note 
that  at  this  pha.se  difference,  the  growth  of  the  subharmonic  is  suppre.ssed  over  a 
small  distance  before  it  starts  growing  again.  The  reduced  growth  rate  at  Rx/A^  — 
1.0  agrees  with  the  predictions  of  Monkewitz  (1988).  At  Rx/A^—  1.6.  where  the 
fundamental  amplitude  is  equal  to  five  times  the  critical  amplitude,  the  phase 
difference  is  still  clo.<e  to  zero.  By  comparison  with  figure  7  of  Monkewitz  (1988).  we 
note  that  the  subharmonic  growth  is  reduced  at  this  phase  angle.  This  also  agrees 
with  the  reduced  growth  rate  at  this  location  as  seen  in  figure  4.  This  reduced  growth 
is  also  due  to  the  low  bicoherence  (figure  7)  which  indicates  low  coupling  and  large 
jjhase  difference  variations.  Further  downstream,  at  Rx/A^  =  2.3.  the  amplitude  of 
the  fundamental  is  equal  to  twice  that  of  the  critical  amplitude.  The  .subharmonic 
amplitude  is  increasing  significantly.  However  the  fundamental  amplitude  is 
decreasing.  By  oomi)arison  with  figure  9  of  Monkewitz  (1988).  we  note  that  the 
subharmonic  should  be  growing  at  any  phase  difference.  This  result  also  agret's  with 
the  energy  transfer  measurements  that  show  efficient  energy  transfer  from  the 
fundamental  to  the  subharmonic  at  this  location. 


6.  Conclusions 

The  nonlinear  and  parametric  characteristics  that  are  as.soeiated  with  the 
■subharmonic  generation  in  the  transition  to  turbulence  in  plane  mixing  layers  are 
detected  and  quantified.  In  the  primary  in.stability  region  of  the  transition,  all 
instability  mmles.  including  the  subharmonic.  grow  independently.  No  significant 
transfer  of  energy  between  any  modes  is  detected.  Measurements  of  the  power 
spectra  and  the  associated  linear  and  quadratic  tran.sfer  functions  show  that  beyond 
this  initial  region,  the  .spatial  growth  of  the  subharmonic  supports  the  parametric 
approach  which  is  described  by  a  linear  model  with  varying  time  coefficients.  On  the 
other  hand,  the  .same  measurements  show  that  the  growth  of  2f„  and  §/,  is  best 
described  by  the  quadratic  nonlinear  interaction  approach.  The  growth  rate  of  the 
subharmonic  starts  modifying  about  the  location  where  Rx/Ag  ^  1.  At  this  location, 
the  fundamental  amplitude  is  equal  to  0.012r  and  has  not  yet  started  to  saturate. 
This  shows  that  the  parametric  resonance  between  the  fundamental  and  the 
subharmonic  can  start  to  affect  subharmonic  growth  before  the  fundamental  reaches 
its  saturation  amplitude.  However,  the  most  efficient  transfer  of  energy  to  the 
subharmonic  occurs  further  downstream  as  the  fundamental  mode  reaches  an 
amplitude  clo.se  to  its  .saturation  amplitude.  Measurements  also  indicate  that  the 
fundamental,  besides  interacting  with  the  subharmonic.  is  also  engaged  in 
redistributing  its  energy  via  nonlinear  interactions  to  other  components  such  as  the 
first  harmonic  and  the  valleys.  Local  wavenumber  measurements  verify  that 
frequency -wavenumber  resonance  matching  conditions  exist  between  the  fun¬ 
damental  and  subharmonic  in  regions  where  efficient  energy  exchange  to  the 
subharmonic  takes  place.  These  results  are  in  general  agreement  with  theoretical 
models  by  Kelly  (1967)  and  Monkewitz  (1988). 
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ABSTRACT 

This  paper  compares  two  methods  (two-tone  input 
approach  and  random  process  input  approach)  for  the 
estimation  of  harmonic  and  intermodulation  distor¬ 
tions  of  nonlinear  systems.  The  random  input  ap¬ 
proach,  where  a  nonlinear  system  is  nmdekd  by  a 
second-order  Volterra  series,  is  examined  in  terms  of 
its  statistical  properties,  and  its  advantages  and  lim¬ 
itations  over  the  classical  two-tone  input  approach. 
Experimental  results  are  shown  where  these  two  ap¬ 
proaches  are  applied  to  evaluate  second-order  distor¬ 
tion  of  a  loudspeaker  and  to  compare  the  performance 
of  these  approaches  in  terms  of  Volterra  kenels  and 
distortion  factors. 

1.  Introduction 

Harmonic  and  intermodulation  distortions,  which 
are  most  significant  in  high-fidelity  systems,  usually 
result  from  nonlinear  input-output  characteristics  of 
some  component.  The  measurement  of  nonlinear  dis¬ 
tortion,  which  is  the  subject  of  this  paper,  has  tra¬ 
ditionally  been  conducted  by  applying  sinusoidal  sig¬ 
nals  and  measuring  the  extent  to  which  harmonic  or 
combination  tones  are  generated  in  the  output  of  the 
component  under  test.  In  contrast  to  the  sinusoidal 
input  approach  [1],  a  random  input  approach  has  been 
developed  to  analyze  nonlinear  distortion  since  a  ran¬ 
dom  signal  well  approximates  a  frequency-multiplexed 
input  such  as  a  speech  signal  [2],[3].  Now  that  two  dif¬ 
ferent  approaches  are  available  for  nonlinear  distortion 
estimation,  the  following  questions  arise;  Are  the  dis¬ 
tortion  products  estimated  by  the  random  input  ap¬ 
proach  the  same  as  the  ones  estimated  by  the  sinu¬ 
soidal  input  approach?  If  so,  under  what  conditions? 
And  what  are  the  advantages  and  limitations  of  each 
approach?  In  this  paper,  we  address  these  questions 
by  (a)  expressing  these  two  approaches  by  a  Volterra 
series  representation  [4],  (b)  investigating  the  statis¬ 
tical  properties  of  the  transfer  function  estimators  in 


the  random  input  approach,  (c)  comparing  results  (in 
terms  of  Vcdierra  kernels  and  distortion  factors) 
tained  by  both  approaches,  and  (d)  discussing  the  ad¬ 
vantages  and  limitations  of  both  approaches.  FinaHy, 
experimental  results  of  applying  the  two-tone  input  i4>- 
proach  and  random  input  approach  to  quantify  second- 
order  distortion  of  a  loudspeaker  are  presented. 


Consider  a  frequency  domain  Volterra  series  repre¬ 
sentation  of  a  quadratically  nonlinear  time-invariant 
system. 

yUm)  =  Hx{U)XUm) 

•>>sm 

where  X{fm),  YUm),  “d  c(/m)  the  N~poiat  dis¬ 
crete  Fourier  transforms  of  the  input,  output,  and 
model  error,  respectively.  The  second-order  fact(»  of 
intermodulation  distortion  (Dim?)  <kt  frequency  /icb/s 
is  defined  as 

DiM2  =  20IognM2l  =  201og[|^:^^|^|]  [dB]  (2) 

where  Vb(/|  ±fi)  is  the  output  voltage  at  the  frequency 
fi  ±  ft-  This  term  may  be  expressed  in  terms  of  the 
frequency  domain  Volterra  kernels  in  the  form 

Dmi  =  20 log  ( I  1  V'l  ]  [dB)  (3) 

where  Vj  is  the  amplitude  of  both  input  voltages 
(v,(f)  =  Vicoa2rfit  +  V|cos2ir/30-  Likewise,  the 
second-order  harmonic  distortion  factor  Dtn,  which  is 
defined  as 

£>Ha  =  201og[H2}  =  201og[|^^|}  [dB]  (4) 


2.  Quadratic  System  Modeling  with 
Random  Input 
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may  be  expressed  in  terms  of  the  Volterra  kernels  as 

DHa  =  201og(l|^^|H]  [dB]  (5) 

Thus,  the  second-order  distOTtion  factors  of  a  compo¬ 
nent  under  test  can  be  obtained  easily  at  various  fre¬ 
quencies  if  the  linear  transfer  function  and  quadratic 
transfer  function  of  the  component  can  be  estimated 
simultaneously  at  those  frequencies. 

By  minimizing  the  mean  square  error  JE7[|f(/m)P] 
one  can  obtain  an  optimum  estimation  of  the  trans¬ 
fer  functions  as  follows: 

+  E  H,{i,i)E[X-{m)Xii)X{i)]  (6) 

•+i=m 

When  the  system  input  is  a  zero-mean  Gaussian  ran¬ 
dom  signal,  the  linear  and  quadratic  transfer  functions 
are  obtained  separately  and  determined  by  the  various 
spectra  up  to  third  order.  However,  it  was  shown  in 
[5]  that  commonly  used  random  inputs,  such  as  those 
provided  by  a  noise  generator,  cannot  automatically 
be  regarded  as  sufficiently  Gaussian  for  nonlinear  iden¬ 
tification  studies.  Thus,  a  practical  digital  method  of 
estimating  nonlinear  distortion  of  a  system  without  as¬ 
suming  any  particular  input  statistics  was  presented 

13], [6]. 

For  non-Gaussian  inputs,  one  has  to  solve  (6)  and 
(7)  simultaneously  with  respect  to  Hi{m)  and 
These  equations  can  be  rewritten  using  vector  notation 
as  follows: 

r[X*(m)y(m)]  =  E(X*(m)X’'(m)]H(m)  (8) 

where  X(m)  and  H(m)  are  of  matrix  form 
(E[X*(m)X^(m)]  consists  of  second-,  third-,  and 
fourth-order  spectral  moments,  and  H(m)  consists  of 
ffi(m)  and  H2{i,j)).  The  optimal  solution  in  the  mean 
square  sense  is  given  by  [6] 

H(m)  =  E[X*(m)X'^(m)]->E[X*(m)y(m)]  (9) 

Here,  we  assume  we  have  a  set  of  M  realizations 
or  records  for  x(n)  and  y(n)  consisting  of  N  samples 
each.  If  A’t(m),  y*(m),  and  ct(m)  denote  the  DFT’s  of 
yt(”)i  and  ei(n),  respectively,  then  the  optimal 
estimator  of  the  transfer  function  is  given  by 

M  M 

ft(m)  =  Ex:(m)Xr(m)]-M5;x;(m)n(m)]  (10) 

t=l  4=1 


where  Xt(m)  is  of  matrix  form  consisting  of  various 
higher-order  spectral  mements,  up  to  fourth  order, 
formed  by  the  k-th  realization.  Provided  that  Cs(m)  is 
independent  of  Xs(m),  it  can  be  easily  shown  that 

E[H(m)J  =  H(m)  (11) 

Provided  that  es(ni)  is  independent  of  where 

k  :^k',  with  variance  <rf,  the  covariance  nruitrix  of 
H(m)  is  obtained  as  follows: 

M 

cov  H(m)  =  <r*[53x;(m)Xj'(m)]-*  (12) 

4=1 

The  result  (12)  can  be  used  to  determine  how  the  vari¬ 
ance  of  the  estimate  decrease  with  the  number  of  re¬ 
alizations.  Furthermore,  it  can  be  shown  that  the  es¬ 
timator  in  (10)  is  unbiased  and  consistent  if  (a)  there 
is  no  noise  at  the  input  point  (b)  noise  at  the  output 
is  uncorrelated  (c)  the  system  under  consideration  is 
appropriately  modeled. 

3.  Bichromatic  vs.  Random  Process  Estimates 

In  order  to  calculate  the  response  of  the  quadrati- 
cally  nonlinear  system  subject  to  two  sinusoidal  inputs 
with  frequency  /i  and  /j  {fi  >  /j),  let  the  input  be 
3 

*=i  , 

which  can  also  be  expressed  as 

*(0  =  2  S  Vti  exp[;2x/,t]  (14) 

^  (=-3 

where 

V7_.  =  vv*,  y/,  =  o, 

The  linear  response  to  the  inputs  specified  by  (13)  is 
given  by 

3 

Vi(0  =  E  coe(2x/<t  0.-  Mfi)]  (15) 

i=l 

The  quadratic  response  to  the  input  specified  by  (14) 
is  given  by 

=  E  /..)«xpt72r(/..+/..)ll  (16) 

Since  H2{-fu-h)  =  and  = 

^3(/2i  /i)  in  real  systems,  (16)  can  be  expressed  as 

»j(0  =  i.  lV',.)’l«,(/../.)|co.|2»(2/.)«  +  2#,  +  ai{/../.)) 

+  2lv',,||V,,||«,(/„/,)|c«il2>r(/,  +  /,)f  -(■«,  +  «,  + 

+  2|V,,|iV,,|lW,(/„  -/,)|c<»12»(/,  -  h)l  +  #,-«,  + 

+  ElV,.!’//,!/..-/.)  (17) 
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Figure  1:  Block  diagram  of  distortion  analysis  set-up 


functions  are  estimated  according  the  solution  given  by 
(10).  Contour  plot  of  the  estimated  (smoothed 

over  adjacent  frequencies),  accounting  for  second-order 
distortion,  is  shown  in  Fig.2(bottom).  Note  that  it  is 
enough  to  plot  only  the  upper  and  tower  triangular  re¬ 
gion  of  in  the  /<  and  fj  plane  because  of 

various  symmetry  properties.  Values  in  the  upper  tri¬ 
angular  region,  where  /,  >  0,  /j  >  0,  /,  >  fj,  repre¬ 
sent  distortions  at  sum  frequencies,  while  values  at  the 
lower  triangular  region,  where  fi  >  0,  fj  <  0,  /,  >  |/>| 
represents  distortions  at  difference  frequencies.  The 
values  located  on  the  45”  line  in  the  upper  triangu¬ 
lar  region  correspond  to  harmonic  distortion  while  the 
others  correspond  to  intermodulation  distortions. 


The  first  term  in  (17)  denotes  second-order  harmonic 
distortions,  the  second  term  denotes  an  intermodula¬ 
tion  distortion  at  the  sum  frequency,  the  third  term 
denotes  an  intermodulation  distortion  at  the  difference 
frequency,  and  the  last  term  denotes  the  DC  compo¬ 
nent.  Note  that  the  output  at  the  difference  frequency, 
/]  ~  f<2,  contains  not  only  the  intermodulation  distor¬ 
tion  at  that  frequency  but  also  the  second-order  har¬ 
monic  distortion  due  to  frequency  /j  if  fi  -  h  =  2/2 
These  harmonic  and  intermodulation  distortions  (or 
and  can  be  identified  by  sup¬ 

plying  only  one  sinusoidal  signal  first  to  measure  har¬ 
monic  distortion  (or  fi))  and  by  subtracting  the 

harmonic  distortion  from  the  total  distortion  subject 
to  two  sinusoidal  signals  to  obtain  intermodulation  dis¬ 
tortion. 

4.  Experimental  Results 

The  second-order  Volterra  approch  described  in  the 
previous  section  has  been  utilized  to  model,  iden¬ 
tify,  and  quantify  the  linear  response  and  a  second- 
order  distortion  product  of  a  loudspeaker  in  the  low- 
frequency  region.  The  arrangement  of  Fig.  1  wjis  used 
to  measure  the  distortion  performance  of  the  compres¬ 
sion  driver  typical  of  those  used  to  drive  acoustic  horns. 
The  noise  source  used  puts  out  a  flat  noise  spectrum 
bandlimited  to  500  Hz.  A  low  pass  filter  with  a  cutoff 
frequency  of  420  Hz  was  inserted  before  the  amplifier 
which  was  adjusted  to  drive  the  compression  driver  at 
10  W,  A  Digital  Signal  Processor  (DSP  56000)  took 
98,304  data  samples  from  the  input  and  output  signal 
at  a  1  kHz  sampling  rate.  These  were  divided  up  into 
384  segments  of  128  samples  each  to  get  the  spectral 
estimates  in  (10). 

Since  the  input  signal  from  the  noise  generator  to  the 
nonlinear  loudspeaker  system  is  not  sufficiently  Gaus¬ 
sian  [5],  one  has  to  solve  (6)  and  (7)  simultaneously  for 
ffi(m)  and  H^ii^j)-  The  linear  and  quadratic  transfer 


To  compare  the  proposed  technique  with  the  conven¬ 
tional  analog  two-tone  method,  u  measured 

by  the  method  described  in  Section  3.  Two  oscillators, 
both  of  them  having  the  same  amplitude,  are  applied 
to  the  aunplifier.  The  output  of  the  amplifier,  which 
generates  a  sinusoidal  signal  at  10  W,  is  connected  to 
the  component  under  test  to  calculate  the  distortions 
at  sum  frequencies  and  difference  frequencies  by  (17). 
Wj(»\j)  in  Fig  2( top),  measured  by  the  two-tone  input 
method,  shows  good  agreement  with  the  one  obtained 
using  the  random  input  approach  except  for  relatively 
small  fluctuations. 

Distortion  factors  of  the  loudspeaker  can  be  calcu¬ 
lated  by  substituting  the  estimated  transfer  functions 
into  (3)  and  (5).  Reasonable  agreement  between  the 
two-tone  input  method  and  the  random  input  method 
is  obtained  except  for  small  fluctuations.  For  addi¬ 
tional  discussion  see  [3]. 

However,  there  are  trade-offs  between  the  two-tone 
input  approach  and  random  input  approach  on  esti¬ 
mating  second-order  distortion.  In  the  two-tone  input 
approach,  it  is  possible  to  measure  accurate  second- 
order  distortions,  although  measuring  such  distortions 
over  the  band  of  frequencies  of  interest  is  a  tedious  and 
time-consuming  job.  On  the  other  hand,  the  random 
input  approach  does  not  require  repeating  the  same 
experiments  by  changing  the  frequency  of  input  since 
distortions  over  the  band  of  frequencies  of  interest  can 
be  estimated  with  one  data  set.  Another  advantage  of 
the  random  input  approach  is  that  one  can  quantify 
distortion  over  the  entire  frequency  band  using  higher- 
order  coherence  spectra  [3].  Also,  random  signals  are  a 
closer  approximation  to  actual  input  signals,  e.g.,  mu¬ 
sic  and  speech.  The  disadvantage  of  the  random  input 
approach  lies  in  the  fact  that  it  requires  a  large  number 
(e.g.,  hundreds)  of  ensemble  averages  to  obtain  good 
estimates. 
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Figure  2:  Contour  plot  of  the  loudspeaker  quadratic 
transfer  function  ))  experimentally  estimated  us¬ 
ing:  (a)  two-tone  input  method  (top),  (b)  random  in¬ 
put  method  (bottom) 


5.  Summary 

The  results  of  this  investigation  indicate  that  distor¬ 
tion  products  of  a  system  can  be  eatimated  with  rel¬ 
atively  high  accuracy  by  the  random  input  approach 
if  there  are  a  sufficient  number  of  realizations  of  raw 
data,  minimal  input  noise,  no  unmodeled  dynamves, 
and  uncorreiated  output  noise.  The  results  obtained  m 
thjs  paper  can  be  also  easily  extended  to  handle  higher- 
order  (e  g.,  third-order)  distortion  of  the  system  Be¬ 
cause  of  the  generality  of  the  Volterra  approach,  the 
results  of  this  investigrUion  can  be  applied  to  messure 
nonlinear  distortion  introduced  by  any  type  of  system, 
including  audio  systems  and  communication  systems. 
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ABSTRACT 

For  higher-order  Volterra  systems  with  arbitrary 
itiiionary  inputs,  an  orthogonalization  scheme,  based 
upon  ordered  sets  of  conditioned  orthogonal  higher- 
order  input  vectors  in  the  frequency  domain  using 
Gram-Schmidt  method  is  presented.  A  new  set  of  non¬ 
linear  transfer  functions  for  the  conditioned  orthog¬ 
onal  Volterra  system  is  obtained  from  a  coordinate 
transformation  of  a  Volterra  system  based  upon  non- 
orthogonal  higher-order  input  vectors.  The  orthogo¬ 
nal  mode!  has  the  same  minimum  mean  square  error 
IS  the  original  system  but  without  hybrid  coherences, 
thus  allowing  physically  meaningful  interpretation  of 
nonlineaiilies  of  systems. 


1.  Introduction 

In  1961,  Tick[l]  developed  expressions  for  linear 
and  quadratic  frequency-domain  Volterra  kernels  in 
terms  of  cross-power  spectra  and  cross-bispectra,  re¬ 
spectively.  These  results  are  valid  for  Gaussian  excita¬ 
tions  only  and  lead  to  an  orthogonal  Volterra  model. 
In  recognition  the  fsutt  that  many  experimental  signals 
used  in  nonlinear  system  identification  are  not  suffi¬ 
ciently  Gaussian  to  make  use  of  Tick’s  results!  1],  Kim 
and  Powers[2]  developed  an  approach  to  estimate  lin¬ 
ear  and  quadratic  frequency-domain  Volterra  kernels 
which  is  valid  for  general  (i.e.,  nonGaussian  as  well 
as  Gaussian)  random  inputs.  However,  their  model  is 
not  orthogonal.  Thus,  when  using  the  model  to  “pre¬ 
dict”  the  power  spectrum  of  the  response  of  a  nonlin¬ 
ear  physical  system,  one  must  deal  with  ‘interference” 
terms,  corresponding  to  cross-product  interactions  be¬ 
tween  the  outputs  of  the  linear  and  quadratic  compo¬ 
nents  of  the  model,  which  greatly  hinder  a  physical 
interpretation  of  the  model.  For  example,  the  “in¬ 
terference”  terms  may  result  in  a  positive  or  negative 
contributic  j  the  output  power  spectrum,  depend¬ 
ing  on  the  relative  phases  of  the  outputs  of  the  linear 


and  quadratic  components  of  the  model.  Thus,  the 
objective  of  this  paper  is  to  present  an  approach  for 
Volterra  kernel  estimation  which  is  valid  for  general 
random  inputs  while  at  the  same  time  eliminating  the 
presence  of  the  interference  terms  commonly  associ¬ 
ated  with  non-orthogonal  models. 

The  approach  is  based  upon  an  extension  to  higher- 
order  Volterra  systems  of  the  work  done  by  Dodds 
and  Robson[3],  and  Bendat[4],  for  the  nwltiple  in¬ 
put/output  linear  system  identification  problem.  Re¬ 
cently,  Archer  and  Gardner  developed  methods  which 
exploit  cycloetationary  inputs  to  obtain  orthogonal¬ 
ity  properties  associated  with  the  Volterra  kernelslS]. 
Kotenberg  and  Paarmann  studied  ortbognal  search 
method  to  find  significant  model  terms  to  system  iden¬ 
tification  in  the  time  domain  [6]. 

In  this  paper,  the  Gram-Schmidt  orthogonaliza¬ 
tion  procedure  is  utilized  to  generate  conditioned  or¬ 
thogonal  polyspectral  inputs  in  the  frequency  dtmnain 
for  arbitrary  stationary  inputs  with  arbitrary  statis¬ 
tics.  Coordinate  transformations  are  utilized  to  relate 
the  orthogonal  Volterra  kernels  to  the  non-orthogonal 
Volterra  kerneb.  This  allows  one  to  develop  an  orthog¬ 
onal  model  of  the  Volterra  aeries  so  that  each  wder 
term  in  the  series  u  orthogonal  to  the  others.  Thus,  a 
partitioning  of  the  output  power  spectra  and  a  sim¬ 
ple  iterative  representation  of  system  transfer  func¬ 
tions  becomes  possible  without  introducing  interfer¬ 
ence  phenomena. 

We  derive  higher-order  conditioned  auto-  and  cross- 
spectral  moment  matrices  in  terms  of  conventional 
higher-order  spectral  moment  matrices.  We  also 
demonstrate  that  the  optimum  solution  for  the  non¬ 
linear  system  transfer  functions  of  the  conditioned  or¬ 
thogonal  model  has  the  saume  minimum  mean  square 
error  as  the  conventional  nonlinear  system  transfer 
functions  of  a  non-orthogonal  Volterra  model  esti¬ 
mated  using  minimum  mean  square  error  criteria. 
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2.  Conditioned  Orthogonal  Input  Vector  for 
Higher-Order  Volterra  System 

A  higher-order  nonlinear  Volterra  system  can  be  re¬ 
garded  as  a  multiple  inputs/single-output  linear  sys¬ 
tem.  Partial  coherence  between  the  inputs  exist  since 
the  multiple  high-order  inputs  are  generated  from  a 
single  input  time  series.  In  such  cases,  high-order  non¬ 
linear  system  can  be  partially  modeled  as  low-order 
nonlinear  system,  and  vice  versa.  Thus,  it  hinders  a 
physical  interpretation  of  the  system  identification. 

In  order  to  remove  the  partial  coherence  between 
the  multiple  high-order  inputs,  thus  rendering  them 
orthogond,  one  can  utilize  the  Gram-Schmidt  orthog- 
onalization  procedure.  Such  conditioning  of  multiple 
input  vectors  allows  us  to  find  the  system  transfer  func¬ 
tions  straightforwardly,  and  the  relationship  between 
the  transfer  functions  of  the  orthogonal  Volterra  sys¬ 
tem  and  the  non-orthogonal  Volterra  system  can  be 
easily  found. 

A  frequency-domain  rth-order  discrete  Volterra  se¬ 
ries  representation  of  the  nonlinear  system  can  be  writ¬ 
ten  as  follows 

•+>+*=»» 

•l+  +•-=*" 

=  H(m)X(m)  (1) 

where  X  [m)  and  N{m)  axe  the  discrete  Fourier  trans¬ 
forms  of  the  input  z(f)  and  the  noise  n(f),  reepectively. 
The  row  vector  H(m)  and  the  column  vector  X(m)  are 
given  by 

H  =  (2) 

X‘  =  (X,,X2,..,X,J  (3) 

where  the  superscript  't'  is  the  transpose  of  a  ma^ 
true.  In  Eqs.  (2)  and  (3),  the  row  vectors  Hi,  H}, 
and  Hr  are  linear,  quadratic,  and  rth-order  transfer 
functions,  respectively.  The  column  vectors  Xi,  Xz, 
Xr  are  Fourier  transforms  of  the  linear,  quadratic,  and 
rth-order  inputs  xi,  xj,  and  x,,  respectively.  For  ex¬ 
ample,  the  linear  transfer  function  and  input  are  given 
by 


where  0  <  m  <  A/,  and  Af  is  the  Nyquist  frequency. 
The  quadratic  transfer  function  and  input  vccUnts  can 
be  written  as 

Hj(m)  s:  I  ■  ■  ]  (6) 


X,(m)  = 


I(*,j)Xii)XU) 


(7) 


where  f(m  -f  l)/2l  <  «  =  m  -  j  <  Af ,  ff)  is  tl» 
largest  mteger  less  than  I,  =  2  for  t  ^  j,  ud 

=  1  otherwise.  The  cubic  transfer  function  and 
input  vectors  are  given  by[7] 

Vsim)  =  [  }  (8) 


X3(m) 


ni,j,k)X(i)XU)X(k) 


(9) 


where  the  indices  t,  j,  and  k  span 

f(m  -f  2)/3l  =  mi  <  i  <  Af 
(■{m  —  mi  1)/2'1  <  j  <  m,  for  each  t 
k  =:  m  —  i  —  j 

The  symmetry  factor  Hi,j,k)  is 
f  6  if  i^j^k 

\  3  if  two  of  three  indices  are  same 
{  1  iri  =  j  =  k 

For  the  higher-order  input  vectors  defined  above, 
their  polyspectra]  matrices  are  defined  by 

Sji  =  (XjX,^}  (10) 

where  (•)  represents  a  expectation  operator,  and  the 
superscript  ‘A’  denotes  a  Hermitian  of  a  matrix. 

The  alternative  orthogonal  model  can  be  con¬ 
structed  by  replacing  the  original  given  input  records 
by  ordered  sets  of  conditioned  input  records  {Zi  = 
Xi-i-i},  i  =  1,2,  ...,r.  The  dependence  upon  fre¬ 
quency  /  of  functions  is  omitted  for  simplicity  in  nota¬ 
tion.  The  conditioned  input  Z,-  represents  Xi  in  which 
the  linear  effects  of  Xi  to  Xi_i  have  been  removed  by 
the  Gram-Schmidt  orthogonalization  procedure. 


Suppose  a  nonlinear  Volterra  system  with  multiple 
input  vectors  Xi,  Xj,  •  •  •,  Xr,  which  are  given  as  Eqs. 
(^)«  (7),  etc.  Clearly,  the  system  can  be  represented 
with  any  linear  combination  of  these  input  vectors.  Let 

Z,  =  X,  (II) 

Z,  ==  X,-L„Zi  (12) 


Hi(m)  =  J/,(m) 
Xi(m)  =  X(m) 


(4) 

(5) 
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We  wish  to  chooee  Lji  so  that  Zi  and  Zj  are  orthog- 
oaal  in  such  a  way  that  (ZjZ^)  =  0.  Then  one  can 
ihow  that 

Ui  =  {XjZ^XZjZ^)-'  =  SnSu^  (13) 

where  S21  is  the  auto-bispectrum  and  Sn  is  the  auto¬ 
power  spectrum  of  the  input  Xi-  Notice  that  the  ex- 
pressioD  given  for  L31  is  the  same  as  the  transfer  func¬ 
tion  when  Xj  is  modeled  with  Xj .  Thus  one  can  inter¬ 
pret  the  ortbogonalized  input  Z3  as  reprinting  Xj 
in  which  the  linear  effect  of  Xi  has  been  removed. 

Now,  one  can  repeat  the  orthogonalization  proce¬ 
dure  by  adding  a  new  orthogonal  basis  vector  and  ap¬ 
plying  orthogonal  conditions  each  time.  Therefore,  in 
general,  for  an  rth-order  nonlinear  Volterra  system, 
one  can  write  Xj  in  terms  of  conditioned  orthogonal 
input  vectors  Zi,  where  i,  j  =  1, . . . ,  r,  as  follows: 


The  row  vectors  Gj ,  G3,  and  Gr  are  linear,  quadratic, 
and  rth-order  transfer  functions  (or  components)  of 
the  conditioned  orthogonal  system,  respectively.  The 
column  vectors  Z],  Z3,  and  Zr  are  linear,  quadratic, 
and  rth-order  conditioned  orthogonal  high-order  in¬ 
puts,  respectively.Tbe  tranaf<»mation  matrix  L  is  a 
block  unit  lower  triangular  matrix  given  by 


■  III 

■ 

L21 

Ij2 

L  = 

L31 

L32  I33 

.  Ln 

Er2  Ifr 

(19) 


where  (=  Ln)  is  an  identity  matrix,  and  u 
given  in  Eq.  (15).  From  Eq.  (16),  one  can  immediately 
obtain  the  new  set  of  conditioned  orthogonal  transfer 
functions  G, 


X;=^Lj,Z.  (14) 

isl 

with  {ZiZj)  =  0  for  t  ^  j.  Furthermore,  one  can 
easily  obtain 

h,i  =  {XjZtKZiZtr^  (15) 

In  this  alternative  conditioned  system,  one  can  re¬ 
gard  the  noise  term  N  as  the  output  Y  (=  X,+i )  con¬ 
ditioned  on  all  of  the  input  records  Xi  to  Xr,  and 
thus,  is  given  by  Z^+i.  In  this  way,  one  can  extend 
the  system  order  to  r-4- 1  in  Eqs.  (14)  and  (15).  Notice 
that  the  noise  N  and  the  output  Y  of  the  conditioned 
orthogonal  system  remain  same  as  those  of  the  original 
system. 


G  =  HL  (20) 

=  (yZ‘')(ZZ'^>-*  (21) 

where  (ZZ^)  is  a  block  diagonal  matrix  with  ith  di¬ 
agonal  element  (ZiZ^)  Now  the  orthogonal  transfer 
function  G  can  be  easily  calculated  using  the  condi¬ 
tioned  polyspectral  matrix. 

For  a  cubically  nonlinear  Volterra  system,  the  con¬ 
ditioned  orthogonal  transfer  functions  can  be  obtained 
using  Elqs.  (19)  and  (20), 

Gj  =  Hi  -f  H2L31  -f  H3L31 
G2  =  H2  ■+■  H3L32 

G3  =  H3 


3.  New  Higher- Order  Transfer  Functions  for 
Conditioned  Orthogonal  Volterra  System 

As  we  have  new  conditioned  orthogonal  coordinate 
system  with  the  basis  of  Zj,  one  can  obtain  new  con> 
ponents  of  the  conditioned  input  vectors,  i.e.  transfer 
functionsGi,  according  ,0  the  coordinate  transforma¬ 
tion  from  the  system  with  the  basis  Xj  to  the  orthog¬ 
onal  system  with  the  basis  Z,.  Using  the  two  different 
coordinate  systems,  one  can  represent  the  output  V  as 
follows, 


y-N  =  GZ 

=  HX 

=  HLZ 

(16) 

where  the  vectors  G  and  Z  are  given  by 

G  =  [Gi,G2,  ■••,Grl 

(17) 

Z'  =  [Z,,Z2.-  -.Zr] 

(18) 

Note  that  the  cubic  transfer  functions  of  two  alterna¬ 
tive  systems  are  identical.  It  is  also  possible  to  reverse 
order  of  the  input  vectors.  Then,  the  linear  trMsfer 
functions  of  the  two  alternative  systerrvs  become  iden¬ 
tical,  and  the  higher-order  transfer  functions  are  mod¬ 
ified  successively. 

Since,  L„  is  an  identity  matrix,  one  can  rewrite  Eq. 
(20)  for  Hi. 

r 

Hi  =  Gi  -  ^  HjLji  (22) 

For  a  cubically  nonlinear  Volterra  system, 


H3  = 

G3 

H,  = 

G2 

—  H3L32 

H,  = 

Gi  -  H2E2J 

-  H3L3, 
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Thus,  one  c&n  find  the  transfer  functions  of  the  orig¬ 
inal  system  recursively  using  the  new  set  of  transfer 
functions  of  the  conditioned  orthogonal  system. 

In  order  to  find  explicit  expressions  for  the  transfer 
functions  of  the  conditioned  system  using  Eq.  (20),  the 
conditioned  polyspectral  matrices  are  needed  in  terms 
of  the  polyspectral  matrices  of  the  original  input  vec¬ 
tors,  which  will  be  described  in  the  next  section. 

4.  Recursive  Relationship  of  Conditioned 
Polyspectral  Matrix 

A  recursive  relationship  between  the  conditioned 
polyspectral  matrix  can  be  found  by  rewriting  Eq.  (14) 
as  follows, 

X;  = 

rs:l 

k 

=  +  (23) 

t=i 

Then,  one  can  write 

=  (24) 

where  Xk-t-i  =  Zj.  By  taking  an  expectation  after 
multiplying  X<  in  Eq.  (24)  for  j  >  i  >  k,  one  can 
obtain 

Sji  k  —  1  (25) 

where  S^,-  *  is  a  conditioned  polyspectral  matrix  ob¬ 
tained  from  jth-order  and  ith-order  input  vectors  in 
which  the  linear  effects  of  Xi  to  Xt  have  been  re¬ 
moved. 

As  an  example,  for  a  cubic  nonlinear  system,  one 
can  find  the  following  from  Eqs.  (15)  and  (25), 

Lji  =  S2iSfj* ,  Lsi  =  Esj  =  S32  iS22*i 

and, 

S32  1  =  S32  -  S3iSJ‘j*Si2 

8221  =  822  —  8218^1*812 

Using  Eq.  (20)  and  the  above  identities,  we  can  see 
the  conditioned  orthogonal  system  is  equivalent  to  the 
original  non-orthogonal  Volterra  system.  Thus,  if  H 
is  a  set  of  optimum  transfer  functions  in  the  original 
non-orthogonal  system,  then  G  is  also  a  set  of  transfer 
functions  in  the  conditioned  orthogonal  system  with 
the  same  minimum  mean  square  error. 


The  decomposition  method  derived  in  this  paper  m- 
configures  the  partially  coherent  higher-order 
ear  input  vectors  of  a  Volterra  system  into  a  set  of  mn- 
tually  incoherent  input  vectors  so  that  the  total  model 
output  power  consists  of  the  sum  of  the  outputs  due  to 
each  of  its  conditioned  orthogonal  input  vector.  The 
advantage  of  this  method  is  that  the  performance  of 
a  system  model  cam  be  improved  successively  by  in- 
creasing  the  order  of  nonlinearity  of  the  system  model 
without  modification  of  the  results  of  previous  noodel. 
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1.  INTRODUCTION 

Volterra  models  have  been  widely  used  to  identify  nonlinear  systems.  Identifi¬ 
cation  of  nonlinear  systems  using  such  models  involves  estimation  of  the  Vol  terra 
kernels,  which  is  usually  computationally  intensive.  A  simple  algorithm  to  estimate 
the  Volterra  kernels  of  quadratic  nonlinear  systems  with  an  i.i.d.  (independent,  iden¬ 
tically  distributed)  input  has  been  developed  recentlyfl).  But  some  nonlinear  systems 
(for  example,  a  nonlinear  communication  channel[2])  can  not  be  modeled  effectively 
by  a  quadratic  Volterra  series.  For  these  nonlinear  systems,  Volterra  models  higher 
than  second  order  are  required.  The  objective  of  this  paper  is  to  develop  a  simple 
algorithm  to  estimate  the  Volterra  kernels  of  cubically  nonlinear  systems,  when  sub¬ 
jected  to  an  i.i.d.  input.  If  conventional  methods  are  used  to  identify  the  Volterra 
kernels  of  cubic  systems,  estimation  of  higher-order  correlations  with  various  time 
lags  up  to  6th  order  is  necessary.  In  addition,  the  required  calculation  of  an  inverse  of 
the  input  correlation  matrix  is  computationally  intensive,  especially  when  the  system 
time  delay  is  long.  By  using  the  properties  of  higher-order  correlations  of  an  i.i.d. 
signal,  we  can  greatly  reduce  the  complexity  of  the  estimation  of  Volterra  kernels.  A 
simple  algorithm  for  estimating  the  Volterra  kernels  of  cubic  systems  is  derived  and 
presented.  This  algorithm  needs  neither  the  estimation  of  higher-order  correiations 
for  various  time  lags  nor  the  calculation  of  an  inverse  matrix.  It  will  be  shown  that 
the  reduction  of  complexity  is  significant. 

2.  VOLTERRA  MODELING  OF  NONLINEAR  SYSTEMS 

A  third-order  Volterra  model  can  be  written  as  follows: 
y(»»)  *  S  ht(i)x{n  -  0  +  5 

•aiO  t»0  imi 


1 


(1) 


N-1  JV-lN-l 

+  S  H  ki{i,j,k)x{n  -  ^■)x(n  -i)x(n  -  *:)  +  e(n) 

•«o  >»*■  **> 

Where  x(n)  and  y(n)  are  the  input  and  the  output  of  a  cubicaliy  nonlinear  system, 
»nd  k3{i,j,k)  are  the  linear,  quadratic,  and  cubic  Voltcrra  kernels  re¬ 
spectively,  and  c(n)  is  the  additive  noise.  In  this  third-order  Voltcrra  model,  although 
the  relationship  between  the  input  and  the  output  is  nonlinear,  the  relationship  be¬ 
tween  the  output  and  the  filter  coefficients  is  linear.  Therefore,  the  Voltcrra  kernels 
can  be  estimated  by  solving  the  following  equation[3]f4]; 


Rh  =  p  (2) 

where 

h  =  (3) 

R  =  £[x(n)x(n)’-J  (4) 

p  =  £(x(n)y(n))  (5) 

x(n)  (x(n),...,x(n  -  N-f  l),x*(n),...,x*(n -N  +  1), 

x^(n),...,x^(n -TV-f  1)]^  (6) 


It  is  noticed  that  the  input  vector  x(n)  includes  not  only  linear  terms,  but  also 
quadratic  and  cubic  terms.  Therefore,  the  input  autocorrelation  matrix  R  consists  of 
higher-order  moments  of  the  input  up  to  6th  order. 

3.  PROPERTIES  OF  AN  I.I.D.  SIGNAL 

The  i.i.d.  signal  plays  an  important  roll  in  many  engineering  areas.  For  example, 
in  digital  communications,  the  input  signal  is  usually  considered  to  be  i.i.d.  Ther¬ 
mal  noise  and  noise  generators,  which  are  frequently  used  in  acoustics,  can  also  be 
regarded  as  i.i.d. (6).  For  a  zero-mean  i.i.d.  signal,  the  second-order  correlation  can  be 
determined  as  follows: 

where  the  property  F(x(ni  )x(n2)]  =  £(x{n,)]E(i(n2)j  if  n2  for  an  i.i.d.  signal 
has  been  used,  and  p2  is  equal  to  £(x*{n)].  Correlations  higher  than  second-order 
can  be  determined  in  a  similar  manner.  Using  the  above  properties,  it  can  be  shown 
that  the  higher-order  correlations  of  an  i.i.d.  signal  up  to  6th  order  can  be  completely 
characterized  by  p2i  A's.  /<5i  and  pe,  where  p,  denotes  the  ith-order  correlation  of 
x(n),  £{x’(n)]. 

4.  THE  CUBIC  IJ.D.  ALGORITHM 

In  the  general  caise,  the  \'olterra  kernels  of  a  cubic  system  can  be  estimated  by 
solving  Eq.  2.  For  a  cubic  system,  the  input  autocorrelation  matrix  R  includes  higher- 
order  correlations  of  the  input  for  various  time  lags  up  to  6th  order.  It  is  also  noticed 
that  the  estimation  process  involves  calculation  of  the  inverse  of  the  R  matrix,  R"*, 


which  results  in  0{L^)  complexity,  where  L  is  the  total  number  of  filter  coefficienis. 
L  is  usually  very  large,  especially  when  the  order  of  the  nonlinear  system  is  high 
(say,  cubic)  and/or  the  memory  of  the  system  is  long.  Therefore  the  whole  estimation 
process  is  very  computationally  intensive.  Some  research  has  been  devoted  to  relic\  ing 
the  computational  burden.  A  simple  algorithm  for  identifying  quadratic  nonlinear 
systems  with  an  i.i.d.  input  has  been  developedfl).  But  in  some  situations,  the 
result  of  using  quadratic  Volterra  model  is  not  satisfactory.  For  example,  in  digital 
communications,  a  nonlinear  communication  channel  can  be  modeled  by  odd-order 
terms  of  a  Volterra  series[2].  To  talc*,'  such  nonlinear  effects  into  account,  adding  at. 
least  the  cubic  term  is  required.  For  these  nonlinear  systems,  a  simple  algorithm  to 
estimate  Volterra  kernels  of  cubically  nonlinear  systems  is  highly  desirable. 

For  a  cubically  nonlinear  system  with  an  i.i.d.  input,  the  kernel  estimation  process 
can  be  simplified  by  using  the  properties  of  an  i.i.d.  signal  mentioned  in  section  3. 
The  input  correlation  matrix  R  for  such  a  system  can  be  completely  determined  as 
long  as  /ij,  /i3,  /i4,  /Is,  and  na  are  known.  In  addition,  since  most  elements  of  the  R 
matrix  are  actually  zero,  the  R  matrix  can  be  reduced  to  a  sparse  matrix.  Thus,  with 
some  algebra,  a  simple  algorithm  for  estimating  the  Volterra  kernels  of  cubic  systems 
can  be  derived.  In  this  algorithm,  the  cubic  Volterra  kernels  are  estimated  by  the 
following  formula: 

/,  m)  =  E[yin)x{n  -  k)x{n  -  !)x(n  -  m)|//i^,  (S) 

0<i:<l<m<N  —  1 

h3(k,l,l)  =5  {/i3£{y{n)z(n-i)x(n~0]-A<2-£(y(n)z(n-i-)^^(»»-0] 

+/i|£{y(n)x(n  -  i)])/r,  (9) 

0<kJ<N-l,k^l 

where 

r  -  Mifh  -  -  mI)  (10) 

After  the  cubic  Volterra  kernels  are  determined,  the  quadratic  Volterra  kernels  can  be 
estimated  as  follows: 

ksikj)  =  {£[y(n)z(n  -  fc)a:(n -/)) -/i2//3[^3(A%/>0 

(11) 

0<Jb</<iV-l 

Similarly,  the  linear  Volterra  kernels  can  be  determined  as  long  as  the  cubic  and  the 
quadratic  kernels  are  known: 

Ai(fc)  =  {£fy(n)z(n-fc)]-/i^/f3(^)-/i3Mfc,*) 

-fi4h3(k,k,k)}/fi3,  (12) 

0  <*<  Af-1 

where  /f3(k)  is  defined  as  follows: 

ft(*)  =  E 

%acO,it^k 


(13) 


Note  that  in  this  algorithm  the  Volterra  kernels  are  expressed  in  terms  of  //j,  //3. 

/is,  /i«  and  higher-order  cross- correlations  between  the  input  and  the  output.  There 
is  no  need  to  calculate  the  higher-order  correlations  of  the  input  for  various  time  lags. 
Futbennore,  since  there  are  no  matrix  operations  involved,  the  computational  com¬ 
plexity  is  greatly  reduced, 

5.  CONCLUSION 

A  simple  algorithm  for  estimating  the  Volterra  kernels  of  cubic  systems  subjected 
to  an  i.i.d.  input  is  presented  in  this  paper.  One  of  the  applications  of  the  i.i.d. 
algorithm  is  in  the  digital  communication  area,  where  the  input  signal  for  digital  com¬ 
munication  is  usually  considered  to  be  i.i.d. 
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ABSTRACT 

We  present  a  new  orthogonal  approach  for  the  estimation  of 
nonlinear  FRF's  which  is  valid  for  general  random  waves  (i.e., 
nonGaussian  as  well  as  Gaussian)  while  at  the  same  time  re¬ 
moving  the  presence  of  the  interference  terms  associated  with 
non-Gaussian  waves.  The  approach  of  this  paper  is  illustrated 
by  using  it  to  quantify  the  linear  and  ouadratically  nonlinear 
dynamical  response  of  TLP’s  to  random  nonGaussian  sea  wave 
excitation,  and  by  comparing  the  performance  of  the  conven¬ 
tional  noo-orthogonal  Volterra  model  with  that  of  the  new  con¬ 
ditioned  orthogonal  model. 


KEY  WORDS:  TLP's,  Orthogonal  Volterra  System  Model, 
NonGaussian  Interference,  Nonlinear  wave  interaction,  Polyspec- 
tral  Analysis 


INTRODUCTION 

This  paper  is  concerned  with  the  application  of  digital  bis- 
pectral  analysis  techniques  (Powers  and  Miksad,  1987)  to  exper¬ 
imentally  detect  and  quantify  nonlinear  wave  interaction  phe¬ 
nomena  associated  with  quadratic  response  of  marine  structures 
such  as  tension  leg  platforms  (TLP)  in  nonGaussian  random 
seas,  without  “interference’’  of  linear  wave  force  and  quadratic 
wave  drift  force  due  to  the  nonGaussian  nature  of  the  wave  ex¬ 
citation  (S.  B.  Kim  et  al.,  1989).  The  presence  of  the  “inter¬ 
ference"  terms  greatly  hinder  a  physical  interpretation  of  the 
model.  For  example,  the  “interference"  terms  may  result  in 
a  positive  or  negative  contribution  to  the  power  spectrum  of 
the  response  predicted  by  the  model,  depending  on  the  relative 
phases  of  the  linear  and  quadratic  components  of  the  model  (K. 
I.  Kim  et  al.,  1987).  Thus,  the  objective  of  this  paper  is  to 
present  a  new  approach  for  the  estimation  of  nonlinear  FRF’s 


which  is  valid  for  general  random  waves  (i.e.,  BonGaussian  as 
well  as  Gaussian)  while  at  the  same  time  removing  the  presence 
of  the  interference  associated  with  non- Gaussian  srave. 

In  this  paper,  we  derive  conditioned  orthogonal  polyspeetral 
moment  noatrices  of  the  wave  excitation  in  terms  of  conventional 
polyspeetral  moment  matrices  utilising  the  Gram-Schmidt  or- 
thogonalization  procedure  which  do  not  depend  on  the  wave 
statistics.  Sonae  accounts  of  the  useful  ideas  of  conditioned  (or 
residual)  spectra  and  partial  coherence  has  been  given  in  the 
books  of  Jenkins  and  Watts  (Jenkins  and  Watts,  1968)  and  oi 
Bendat  and  Piersol  (Bendat  and  Piersol,  1971).  Such  concepts 
are  generalised  by  Dodds  and  Robson  (Dodds  and  Robaon,  1975) 
for  multivariate  stationary  random  processes.  Bendat  (Bendat, 
1976)  utilised  the  concepts  for  linear  system  identification  from 
multiple  input/output  data.  Even  though  they  developed  the 
ideas  from  a  spectral  density  matrix  from  a  set  of  mutually  in¬ 
coherent  white  noise  sources,  it  is  consistent  with  the  ideas  of 
Gram-Schmidt  orthogonalisatioD  procedure. 

We  generalise  the  concept  conditioned  orthogonal  spectra 
for  nonlinear  system  identification  with  higher-order  Volterra 
system  model.  Such  an  orthogonaliaed  model  is  different  from 
Barrett  (or  Wiener)  orthogonal  series  in  such  a  way  that  the 
latter  is  constructed  from  a  hierarchy  of  functionals  of  increas¬ 
ing  order  that  are  orthogonal  to  each  other  with  respect  to  a 
white  Gaussian  (or  Gaussian)  input  and  whose  sum  character¬ 
ises  the  system  (Barrett,  19^).  In  our  model,  an  orthogonal- 
ixation  procedure  is  applied  to  the  multiple  order  input  vectors 
for  arbitrary  input  statistics.  Using  this  orthogonal  model  of 
the  VoHerra  series,  a  partitioning  ^  the  response  power  spectra 
and  a  simple  iterative  representation  of  FRF’s  becomes  possible 
without  introducing  intetference  phenomena.  We  obtain  the  re¬ 
lationship  between  the  new  FRF’s  of  the  conditioned  orthogonal 
system  and  the  FRF’s  of  the  non-ortbagonal  Volterra  qrstem  for 
general  random  wave  excitation  by  utilising  the  concept  of  co¬ 
ordinate  transformation. 
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FisAlly,  we  illustrate  the  approach  of  this  ps4>er  by  using 
it  to  quantify  the  linear  and  quadratically  nonlinear  dynamical 
lesponse  of  tension  leg  platforms  to  random  nooGaussiao  sea 
wave  excitation,  and  by  comparing  the  performance  of  the  con¬ 
ventional  non-orthogonal  Volterra  model  and  associated  *pre- 
diction”  of  the  TLP  response  power  spectra  with  that  of  the 
new  conditKHied  orthogonal  model  described  in  th»  paper.  Of 
particular  importance  is  the  fact  that  the  conditimied  orthogo¬ 
nal  model  allows  one  to  decompose  the  observed  response  into 
its  Unear  and  quadratic  components  without  the  uncertainty  as¬ 
sociated  vrith  the  nonGaussian  interference  term. 


ORTHOGONALIZATION  OF  THE  HIGHER-ORDER 
INPUT  VECTORS 

This  is  a  generalization  of  multi-input  Unear  system  identifi¬ 
cation  with  conditioned  inputs  carried  out  by  Bendat  (Bendat, 
1976),  to  include  higher-order  Volterra  system  with  arbitrary 
inputs.  The  necessity  to  include  nonGaussian  inputs  in  higher- 
order  Volterra  models  is  discussed  in  the  references  of  K.  I.  Kim 
(K.  I.  Kim  et  al.,  1987)  and  S.  B.  Kim  (S.  B.  Kim  et  al-,  1989), 
and  in  the  collection  of  works  in  the  area  of  nonGaussian  sig¬ 
nal  processing  (Wegman  et  al.,  1989).  Such  generaUzation  us¬ 
ing  multi-input  system  concepts  is  possible  since  one  can  regard 
higher-order  Volterra  series  as  linear  in  terms  of  the  Volterra  ker¬ 
nels  (or  frequency  response  functions),  though  it  is  nonlinear  in 
terms  of  the  input.  In  our  model,  each  higher-order  input  is  re¬ 
garded  as  an  independent  input  so  that  a  higher-order  nonUnear 
time- invariant  Volterra  system  becomes  a  multiple  input/output 
system.  In  this  paper,  since  we  are  dealing  with  quadratic  drift 
response  of  a  TLP  due  to  random  waves,  we  are  going  to  Umit 
ourselves  to  second-order  V'olterra  system,  even  though  one  can 
generalize  the  model  to  any  order  nonUnear  Volterra  system. 

Lets  first  consider  a  nonlinear,  time  invariant  analytic  system 
which  can  be  described  with  a  second-order  Volterra  series, 

y(t)-n(t)  =  J  h,(T)x{i  -  T)dT 

■+  y  y  A}(Tl,t3)*(l  -  T,)l(t  -  r,)dT,dTi  (1) 

where  i{<),  y(t),  and  n(t)  are  the  time  series  of  the  input  wave, 
the  output  response,  and  the  error  (or  noise)  of  the  model,  and 
hifr)  and  Aj(t,,  T2)  are  the  Unear  and  the  quadratic  Volterra 
kernels,  respectively. 

Let’s  look  at  the  system  described  in  Eq.  (1)  in  the  frequency 
domain  by  performing  a  discrete  Fourier  transform.  Then,  the 
second-order  discrete  Volterra  series  representation  of  the  non¬ 
Unear  system  can  be  written  as  follows 

Y{m)  =  H,{m)X{m)+  '£  +  N(m)  (2) 

where  X{m),  Y{m),  N(m),  ffi(m),  are  disoete  Fourier 

transforms  of  i(t),  y(t),  nft),  Ai(r),  Aj(ri,Tj),  respectively. 


Figure  1:  Second-Order  Multiple  Input  System  Model. 


Notice  that  Elq.  (2)  is  Unear  in  terms  of  the  frequency  re¬ 
sponse  functions  ^i(m),  though  the  equation  is  non¬ 

linear  with  respect  to  the  input  wave  X{m).  Thus,  by  regarding 
X^(i).Y(y)  as  a  separate  input,  one  can  treat  the  second-order 
nonUnear  system  as  a  two-input  Unear  system,  as  shown  in  Fig. 
1. 

y(m)  =  f'(m)-HV(m) 

=  H,(m)X,(m)  +  H,(m)X,(m)  +  V(m)  (3) 

where  Y{m)  is  the  model  output.  In  Eq.  (3),  the  Unear  frequency 
response  function  H|(m)  and  the  first-order  input  Xi(m)  are 
given  by 

H,(m)  =  /f,(m)  (4) 

X,(m)  =  X(m)  (5) 

where  0  <  m  <  M,  and  M  is  the  Nyquist  frisquency.  The 
quadratic  frequency  response  fiinction  H|(m)  and  second-order 
input  vector  Xjfm)  can  be  written  as 

H,(m)  »  (  •••//,(•,;)  ••  ]  (6) 

X,(m)  =  l{iJ)X{i)X(j)  (7) 


where  f(m  -f  1)/2'1  <  •  w  m  —  y  <  M,  is  the  largest  integer 
less  than  t,  =  2  for  i  ^  jf,  and  /(»,;)  =*  1  otherwise. 

In  general,  the  two  inputs  Xi(m)  and  Xsfm)  are  not  in¬ 
dependent,  especially  for  nonGaussian  input  where  the  cross- 
coherence  betvieen  the  two  inputs  {Xi(m)X}  (m)),  which  is  the 
auto-bispectruro  of  the  input  wave,  is  not  zero.  The  superscript 
‘A’  denotes  a  Hermitian  of  a  matrix  and  (-)  represents  a  expec¬ 
tation  operator. 

As  shown  in  Fig.  2,  the  alternative  ortbogmsal  ntxidel  can  be 
constructed  by  replacing  the  original  input  records  by  ordered 
sets  of  conditioned  input  records  Z|(m)  and  Z3(m),  where  the 
conditioned  input  Z}(m)  represents  Xjfm)  in  which  the  Unear 
effects  of  Xi(m)  have  bm  removed  by  the  Gram-Sdunidt  or- 
thogonaUzation  procedure. 
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Suppose  a  second-order  Vcdterra  system  vrith  two-input  vec¬ 
tors  X|(m)  and  Xjfm)  which  are  given  as  Eqs.  (5)  and  (7). 


the  origin*]  lyttem. 


N(in) 


Figure  2:  Second-Order  Conditioned  Orthogonal  Multiple  Input 
Syitem  Model. 


Clearly,  the  tyttem  can  be  repreaented  with  any  linear  combi¬ 
nation  of  these  input  vectors.  Let 

Zi(m)  =  X,(m)  (8) 

Zj(m)  =  X2(m) -L„{m)2,(fn)  (9) 

We  wish  to  choose  L^iCm)  so  that  Zi(m]  and  Z](rn)  are  orthog¬ 
onal  in  such  a  way  that  {Z](m)Z*(m)}  =  0.  Then  one  can  show 
that 

loi(m)  =  {X,(m)Zf(m))<Z,(m)Zf(m)>-‘ 

=  S,i(m)S;',‘(m)  (10) 

where  S)i(m)  is  the  auto-power  spectrum  and  S]|(rn)  is  the 

auto-bispectrai  vector  of  the  input  wave  Xi(m),  which  are  given 

as 


Sj,(m) 

Sji(m) 


{X(m)X*(m)> 

(X,(m)2f(m)) 


(H) 


(12) 


where  i  -f-  j  =  m.  Notice  that  the  expression  given  for  Lai(m) 
is  the  same  as  the  frequency  response  function  when  X](m)  is 
modeled  with  Xi(m).  Thus  one  can  interpret  tbeorthogonalized 
input  Zjfm)  as  representing  Xj(m)  in  which  the  linear  efl'ect  of 
Xi(rn)  has  been  removed. 


HIGHER-ORDER  FRF’S  FOR  CONDITIONED 
ORTHOGONAL  VOLTERRA  SYSTEM 

Now,  one  can  obtain  the  new  linear  and  the  quadratic  fre¬ 
quency  response  functions  Ki(m)  and  Kj(m)  from  the  condi¬ 
tioned  orthogonal  coordinate  system  with  the  basis  of  Zi(m) 
and 

Y(m)  =  K,(fn)Z,(m)  K,(m)Z,(rn)  N(m]  (13) 

where  Z](m}  ud  Z](m)  are  linear  and  quadratic  conditioned 
orthogonal  inputs,  respectively.  Notice  that  the  noise  N(m)  of 
the  conditioned  orthogonal  system  remains  the  same  as  that  of 


Conridering  the  orthogonal  properties  of  the  two  conditioMd 
inputs  Z](m)  and  ZsCm),  one  can  find  the  new  FRF's  Kt(m) 
and  Kj(ni)  of  the  condioned  orthogonal  system  using  least  mean 
square  criterion  in  terms  of  the  origina]  input  spectral  matrices 
as  follows 


Ki(m)  «  S,,(m)Sj,'(m)  (14) 

Ks(m)  »s  S,*(m)Si’(”‘)  (15) 

In  Eq.  (15),  the  numerator  §^j(m)  is  the  wave-surge  cross  bit- 
pectrum  where  the  apparent  contribution  due  to  nonGaussian 
quadratic  wave-wave  interference  is  subtracted.  The  denonu- 
nator  SB(m)  is  the  forth-order  polyspectrum  due  to  four  wave 
interactions  where  the  apparent  contribution  due  to  nonGaus¬ 
sian  quadratic  wave-wave  interference  is  subtracted.  Notice  that 
the  fourth-order  spectra]  matrix  $n(m)  consists  of  an  eleraoit 
(X{i)X{j)X{kyX(iy),  where  t  -f  y  «  i  +  1 «  m. 

Thus,  K](m)  is  the  QFRF  with  arbitrary  input  wave  without 
the  apparent  contribution  due  to  nonGaussian  wave-wave  inter¬ 
ference.  Also  note  that  K|(m)  is  identical  srith  the  LFRF  for 
Gaussian  input.  The  QFRF  Ki(m)  of  the  conditioned  orthogo¬ 
nal  model,  which  is  identical  with  the  QFRF  for  non-ortbogonal 
mode),  is  reduced  to  the  QFRF  for  Gaussian  modd  when  the 
bispectrum  Si](m)  (or  Ss)(m))  is  scro. 

One  can  relate  the  new  FRF's  Ki(m)  and  K2(m)  of  the  con¬ 
ditioned  orthogonal  system  with  the  FRF’s  H|(m)  and  H}(m)  of 
the  Voltena  system  with  nonGaussian  inputs,  according  to  the 
coordinate  transformation  from  the  system  with  the  basis  Xi(m) 
and  Xt(m)  to  the  orthogonal  system  with  the  basis  Z|{m)  and 
Z](m).  For  a  quadratically  nonlinear  Volterra  system,  one  can 
obtain  the  conditioned  orthogonal  fiequency  response  functions 
using  Eqs.  (8)  ~  (10), 

Ki(m)  =  Hj(m)  +  Hj(Tn)L»i(m) 

Kr(m)  =  H,(m)  (16) 

Note  that  the  QFRF’s  of  two  alternative  quadratic  Volterra  sys¬ 
tems  are  identical.  Using  the  Eqs.  (10)  (12),  (14)  and  (15), 

one  can  find  the  expression  for  the  linear  and  quadratic  model 
power  s:v«-tr*  Pi(m)  and  Pj(m)  for  the  orthogonal  Volterra 
system  as  follows 

P,(Tn)  =  S,T(m)Sr,'(m)S„(m)  (17) 

P»(r«)  -  S,,(m)S;,*('n)§,*,(m)  (18) 

The  the  total  model  power  spectrum  5„(m)  for  the  orthogonal 
Volterra  model  becomes 

S„(m)  =  Pi(m)  +  P,(m) 

=  H,(m)S„(m)H*(m)  +  H,(m)S„(m)H*(m) 

+  2R«|H,(m)S„(m)H,*(m)) 

=  Pr(m)  +  Pp(m)  +  PtQ(ni)  (19) 

which  is  the  same  a*  the  total  model  power  spectnun  for  the  non- 
orthogonal  Volterra  model.  Thus,  two  alternative  systems  have 
the  same  model  error  power,  and  both  are  optimum  sirttein*.  In 
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E<}.  (19),  Pi,(»n),  Pg(»»»)  4od  Ptg(m)  are  the  liaew’,  <{u4Mir*tic, 
and  hybrid  ooodel  power  spectra  for  the  non-ortbogonal  Volterra 
system,  respectively. 

The  eoherence  functions,  which  are  defined  as  the  ratios  of 
the  model  power  spectra  to  the  measured  output  power  spec¬ 
trum,  are  then  calculated  using  Eqs.  (17)  ~  (19), 

Ti(»n)  *=  7?(»n)+7j(m) 

»  7l(”»)  +  7g(>«)  7lo("»)  (iO) 

where  7f(m}  and  Tj{m)  are  the  tinear  and  the  quadratic  coher¬ 
ence  functions  of  the  conditioned  orthogonal  system,  and  7i(m), 
7j(m),  and  72g(m)  are  the  linear,  quadratic,  and  hybrid  coh- 
emece  functions  for  the  non-orthogona!  Volterra  system,  respec¬ 
tively. 

Notice  that  the  expressions  of  Pi(m)  and  P](m)  in  Eqs.  (17) 
and  (18)  are  positive  definite  so  that  the  linear  and  the  quadratic 
coherence  ftmctions  7?(m)  and  7j(m)  of  the  conditioned  or¬ 
thogonal  lystun  are  bounded  between  xero  and  one.  However, 
for  non-ortbogonal  system,  the  boundoess  is  not  guarantoed  so 
that  the  linear  and  the  quadratic  coherence  functions  and 

7^(m)  of  the  non-ortbogonal  system  can  be  greater  than  one. 
Furthermore,  the  hybrid  coherence  function  can  be  neg¬ 

ative  depending  upon  the  phase  of  Hi(m)S]|(m)H^(m).  Such 
chaiacteristics  of  coherence  functions  of  non -orthogonal  system 
cause  great  difficulties  in  meaningful  physical  interpretation  of 
the  analysis  results. 


EXPERIMENTAL  RESULTS 

In  this  section  we  co'npare  the  results  of  the  polyspectral 
analysis  using  non-ortbogonal  second-order  Volterra  model  with 
the  results  of  the  analysis  using  orthogonal  Volterra  model.  The 
experimental  data  we  obtained  from  a  scaled  (1:54)  model  test  of 
a  prototype  TLP  anchored  in  1500  feet  of  water  and  is  described 
elsewhere  (S.  B.  Kim  et  al.,  1989).  The  TLP  was  excited  using 
45°  uni-directional  inegular  wave  of  significant  wave  height  40 
feel.  Waves  were  generated  according  to  the  Pierson-Moskowitz 
spectrum.  The  wave  and  surge  data  were  sampled  at  a  rate  of 
17  Hz  which  resulted  in  a  data  record  of  4260  samples.  Since 
our  major  interest  was  in  the  low-frequency  resonance  drift  os¬ 
cillation  of  TLP  surge,  we  resampled  the  data  using  a  reduced 
sampling  rate  of  2.125  Hz  for  the  higher-order  spectral  analysis. 

We  analyzed  the  surge  response,  which  contains  both  linear 
wave  response  around  0.5  Hz  and  the  response  due  to  quadratic 
difference  frequency  interactions  at  the  surge  resonance  band 
around  0.07  Hz  (S.  B.  Kim  et  al.,  1989),  using  the  two  alterna¬ 
tive  system  model.  In  Fig.  3,  we  present  the  linear  and  quadratic 
coherence  functions  of  wave-surge  time  series  of  the  TLP  using 
the  non-ortbogonal  model  (Fig.  3-a)  and  the  conditioned  or¬ 
thogonal  model  (Fig.  3-b).  Overall  features  of  the  two  plots  are 
similar  in  the  sense  that  both  the  quadratic  coherence  functions 
7q(/)  of  Ibe  non-ortbogonal  model  and  7](/)  of  the  conditioned 
orthogonal  model  are  quite  high  in  the  frequracy  bands  below 
0.2  Hz  and  above  0.8  Hz,  whereas  the  linear  coherence  functions 


7i(/)  and  7i'(/)  ate  negligible  in  these  tame  frequency  bands 

On  the  other  band,  the  linear  eobemice  functions  7^(/)  of 
the  non-orthogonal  OKxitd  and  7f(/)  of  the  conditioned  orthog¬ 
onal  model  are  cfose  to  unity  in  the  wave-excitatioi]  band  oen- 
tered  at  approximately  0.5  Hz  Bowewer,  tbere  are  frequency 
aoocs  where  7[(/)  and  7j(/)  m  Fig.  J-a  are  larger  than  one 
7ig{/}  >•  negative.  FWthermore,  there  is  targe  variation 
of  coherence  with  frequency.  Thoee  behaviors  of  the  cahereoce 
functions  obtained  using  a  noo-orthogonal  model  greatly  hin¬ 
ders  physical  interpretatioci  of  the  analysis  results.  In  the  case 
of  the  conditioned  orthogonal  model,  as  one  can  see  in  Fig.  S-b, 
tbe  linear  and  quadratic  coherence  fimctions  7f(/)  and  7|(/) 
are  bounded  between  zero  and  one  at  all  fr^uencies  and  there 
is  no  hybrid  coberenoe  due  to  nonCaussian  interferenoe.  They 
are  also  much  more  stable,  as  one  can  see  in  the  figure. 


Figure  3;  TLP  Surge  Model  Coherence  Functioos.  a)  Non- 
Orthogonal  Model,  b)  Conditiooed  Orthogonal  Model 

In  Fig.  4,  we  compare  the  linear,  the  hybrid  and  tbe  quadratic 
model  power  spectra  Ptif),  Pu)(f),  and  Pgif)  of  wsve-suige 
time  series  of  the  TLP  using  the  non-orthogonal  model  (Fig.  4-a) 
with  the  linear  and  tbe  quadratic  model  power  spectra  P|(/)  and 
Pi{f)  of  the  conditioned  orthogonal  model  (Fig.  4-b).  In  Fig. 
4-a,  the  abaolute  value  of  the  hybrid  power  spectrum  iFto(/)l 
is  plotted  due  to  tbe  fact  that  it  is  negative  at  some  frequencies. 
Tbe  magnitude  of  the  hybrid  power  spectrum  cannot  be  greater 
than  the  sum  of  the  linear  and  quadratic  model  powers,  however, 
it  is  certainly  greater  than  the  linear  model  power  within  tbe 
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Frequency  [Hz] 


Figure  4;  TLP  Surge  Model  Power  Spectr*.  •)  Non-Orthogonal 
Model,  b)  Conditiooed  OrtbogoniJ  Model 


frequency  band  where  quadratic  interactions  are  dominant  in  our 
example.  On  the  other  hand,  we  don’t  see  such  ill  effects  in  the 
model  power  spectra  estimated  using  the  conditioned  orthogonal 
model  shown  in  Fig.  4-b,  though  the  total  system  model  power 
spectra  of  the  two  systems  are  identical  as  we  have  proved  in 
the  previous  section. 

As  we  discussed  earlier,  the  successful  outcome  of  the  con¬ 
ditioned  orthogonal  model  results  from  the  fact  that  the  appar¬ 
ent  nonGaussian  interference  effects  are  assigned  in  such  a  way 
that  no  hybrid  interactions  between  the  linear  and  the  quadratic 
models  exist  utilizing  the  orthogonal  procedure,  whereas  in  the 
non-ortbogonal  model  the  effects  are  shared  by  both  the  linear 
and  the  quadratic  system  models.  In  fact,  there  is  no  unique 
way  to  assign  such  nonGaussian  interference  effects.  However, 
we  follow  the  idea  that  no  results  from  the  linear  system  anal¬ 
ysis  should  be  changed  due  to  the  effects,  which  one  can  see 
from  the  fact  that  we  have  the  same  expression  of  the  LFRF 
Kj(/)  as  the  LFRF  obtained  assuming  linear  system  model  only 
(i.e.,  Ha(/)  =  0).  Thus,  one  can  successively  improve  the  sys¬ 
tem  model  by  adding  higher-order  terms  in  the  Volteira  system 
without  modifying  the  lower  order  FRF’s. 


By  utilizing  digital  polyspectral  analysis  techniqiies  inocwpo- 
rated  svith  higher-order  Voiterra  system  models,  ssc  are  able  to 
identify  the  effecu  on  the  raspoose  of  a  TLP  due  to  the  nooGaus- 
sian  interference  cd  wave-wave  mteractioBs.  Tbit  was  passible 
with  the  new  tools  we  have  developed  to  remove  the  interferenoe 
effects  by  conditioning  the  input  wave  so  that  each  higher-order 
input  wave  vectors  are  orthogonal  each  other.  We  derived  the 
new  conditioned  input  vectors  and  their  polyspectra  using  origi¬ 
nal  input  polyspectral  matrix,  and  the  rdatioaship  between  the 
FRF's  of  the  orthogonal  system  and  of  the  non-orthognal  sys¬ 
tem  is  obtained  using  the  ooaoept  of  coordinate  transformation. 
The  application  to  the  scaled  noodel  TLP  test  experimental  data 
yielded  improved  results  over  the  non-orthogonal  model  without 
ill  effects  due  to  nonGaussian  interference. 
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